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ABSTRACT 


This thesis evaluates the capabilities of artificial 
neural networks i forecasting the "take-rates" of the 
Voluntary Separations Incentive/Special Separations Benefit 
(VSI/SSB) programs for male. Marine Corps enlisted personnel 
in the grades of E -5 and E-6. The Artificial Neural Networks 
models are compared with the forecasting abilities of a 
classical regression model. The data are taken from the 
Headquarters Marine Corps Enlisted Master File which contains 
military and personal background on each enlisted member of 
the United States Marine Corps. The classical regression 
model is a causal model constructed based upon the theory of 
occupational job choice. The neural network models are 
presented with all available data elements. Empirical results 
indicate that artificial neural networks provide forecasting 
results at least as good as, if not better than, those 
obtained using classical regression techniques. However, 
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1, INTRODUCTION 

A. BACKGROUND 

Prior to implementing a policy, government decision makers 
attempt to understand its potential impact. This is 
especially true in today's environment of tight fiscal 
constraints. The pressure to understand these implications is 
even greater in the Department of Defense due to the attention 
focused on the rising cost of the ‘'peace dividend.*' DoD 
policy analysts must be able to forecast accurately the cost 
of implementing new policy. 

When these policy changes are related to manpower and 
personnel, forecasting the impacts of policies becomes 
particularly complicated. This is because policy analysts 
must be able to predict the decisions of individuals. For 
example, if the United States Navy wishes to increase the re- 
enliJtment rate of boiler technicians, one of the tools 
available is the selective re-enlistment bonus (SRB). 
However, policy makers need to know how large a bonus is 
required to meet required personnel staffing levels. Re¬ 
enlistment is an individual choice and the factors affecting 
this choice vary from one individual to the next. 

Manpower and personnel policy analysts have several tools 
to assist them in forecasting the decisions of individuals. 
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Currently, econometric models based on classical regression 
techniques are the primary tool of choice. However, there is 
another tool emerging that has the potential to replace or 
augment classical regression techniques in forecasting 
personnel decisions. This tool, artificial neural networks 
(ANN), bases its design on the functions of the human brain. 
Current research ir. the manpower and personnel fields provides 
supporting evidence that artificial neural networks can yield 
better forecasting results than classical regression 
techniques in some applications. 

The potential applications for artificial neural networks 
are spread across many disciplines. Artificial neural 
networks are being employed in the medical industry for cancer 
research and illness diagnosis [Ref. l]; in the business world 
for stock market predictions [Ref. 2] and credit risk 
assessments [Ref. 3]; in criminology for fingerprint 
classification and handwriting analysis; and, in DoD, for 
target recognition and aircraft systems control. [Ref. 4] 

Artificial neural networks (ANNs) offer two benefits to 
the user [Ref. 5;pp, 3-5]. First, many of the assumptions 
associated with classical regression techniques are not needed 
when employing neural networks. ANN model developers claim 
that analysts do not have to worry about multicollinearity 
between independent variables, heteroskedasticity, or serial 
correlation. Additionally, the relationships between the 
independent and dependent variables do not have to be linear. 
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The second benefit of neural networks is in variable 
selection. There need not be an underlying theory to support 
the choice of independent variables. A researcher can apply 
his common sense in selecting independent variables. It is 
important to caution here that variables should still be 
chosen carefully as in the applicat n of classical regression 
techniques. However, supporters of ANN claim that users do 
not have to be theoretical experts in the field where they are 
applying ANNs. ANNs are designed to seek out patterns that 
are complex and not readily noticeable to the analyst. If a 
pattern does not exist between dependent and independent 
variables, then the network "learns" to disregard the 
independent variable's values. 

This thesis examines the applicability of ANNs to a 
current manpower related issue, the downsizing of the active 
duty force. Manpower and personnel policy makers are 
implementing separation incentive programs to entice service 
members to leave the service. The United States Marine Corps 
currently utilizes two of these programs, the Voluntary 
Separations Incentive (VSI) ard the Special Separations Bonus 
(SSB), to shape its force structure. Accurately forecasting 
which individuals will accept these programs is paramount for 
two reasons. First, in budgeting limited financial resources, 
it is fc..j 3 ential to predetermine, as well as possible, the 
number of service members who will select the programs once 
they are offered them. The second reason centers on the 
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eligible population for the program. Since the programs are 
designed to shape the force structure, the categories of 
personnel targeted to receive these programs must be carefully 
selected. Further complications to the implementation process 
arise from the congressional mandate requiring that all those 
who apply for the programs are accepted. 

This thesis provides information for policy makers to use 
in reaching more informed decisions about the implementation 
of the VSI and SSB programs. Additionally, it adds another 
tool to the arsenal of policy analysts when attempting to 
determine the impacts of policy changes. 

B. THESIS OBJECTIVES 

This thesis has two objectives. The first is an 
evaluation of the effectiveness of ANNs as a tool for manpower 
analysts. ANNs are currently receiving a great deal of 
positive publicity [Ref. 6;p. 3]. Developers and researchers 
are treating neural networks as one of the most important 
advances in computer technology in recent years. However, 
before it can be effectively utilized as a forecasting tool 
for manpower and personnel decisions, it should be examined 
with a critical eye and evaluated under different conditions. 

The other objective of this thesis is to examine several 
of the possible combinations of choices that can be made when 
constructing a neural network. Builders of ANNs must possess 
a knowledge of the different facets of a network and make 
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conscious decisions as "to the form of the network. The 
architecture, topology, learning rule, and transfer functions 
of a network must be clearly defined by the developer. There 
are many possible combinations and each produces a slightly 
different network. How these decisions are reached is 
currently the topic of much of th'' research surrounding neural 
networks. 

C. RESEARCH QUESTIONS 

This thesis attempts to answer the following questions. 

1. Do artificial neural networks provide better 
forecasting results than classical regression techniques 
in forecasting the "take—rates" of voluntary incentive 
programs offered to Marine Corps enlisted personnel? 

2. Which artificial neural network "topology" provides 
the "best" forecasts of these "take-rates"? 

D. SCOPE AND LIMITATIONS 

This thesis has two limitations. The first concerns the 
data set. This study examines only Marine Corps enlisted 
personnel offered the separation incentive programs. Officers 
are eligible for the programs but the number of applicants 
provide too small a sample. 

The other limitation deals with the algorithms underlying 
the classification processes that are utilized by an ANN. 
These algorithms are complex and quite lengthy. An 
examination of the mathematical formulas that support ANN is 
beyond the scope of this thesis. 
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E. RESEARCH METHODOLOGY 


This thesis uses a data set developed by members of the 
Enlisted Career Force Planning Cell at Headquarters, Marine 
Corps. The data are taken from two places. A list of all 
enlisted members who voluntarily separated from the Marine 
Corps under the incentive programs is combined with 
information on each individual available from the Marine Corps 
Enlisted Master File. The data are from fiscal year 1993. 

These data are used to construct three models to forecast 
the "take-rates" of the separations programs offered. The 
first model is a variation of a binomial LOGIT model which was 
developed by Noblit (1993). It contains independent variables 
that represent the lifetime earnings potential of an 
individual along with proxy variables for his ability, 
preferences, and family background. 

The other two models are ANN models constructed using a 
commercially available software package, Neura]Works 
Professional II/Plus developed by NeuralWare Incorporated. 
This software is chosen because of its flexibility in creating 
a neural network and its user friendly interface. 

One neural network model is constructed and adjusted to 
reach its peak forecasting potential. The topology of the 
initial model contains one hidden layer with thirty hidden 
neurons. The other ANN model is similar in design except that 
it contains two hidden layers, one with twenty hidden neurons 
and the other with ten hidden neurons. The two ANN models are 
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coEipared using criteria such as the correctly predicted takers 
and correctly predicted non-takers of the programs. 


F. ORG&MI2ATIONAL DESCRIPTION 

The thesis is divided into three parts. The first part is 
comprised of Chapters II and III. These two chapters lay the 
foundation for the rest of the thesis. Chapter II provides 
background information on the functions of both types of 
neural networks, biological and artificial. Chapter II also 
contains a section which describes the decisions designers 
face when constructing an ANN. 

A review of current literature on ANNs is presented to 
demonstrate the nature of its components. Individuals 
inexperienced with neural networks can gather the necessary 
knowledge from the literature review to comprehend the steps 
involved in the building a neural network. 

The other aspect of the literature review builds the 
conceptual framework fcr the decision to take a separations 
incentive. Literature dealing with the theory of occupational 
choice is reviewed. Although the separations px-ograms are 
relatively new, there exist many studies on the impacts of the 
SRB. The goal of the SRB is to entice military members to 
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occupations. In essence, the separation incentives are simply 
the SKri in reverse. 

Chapter III describes the VSI and SSB programs along with 
the implementation plan used by the Marine Corps in employing 
these two programs. This is followed by a discussion of the 
theoretical framework that Noblit (1993) uses in developing 
his model. The economic theory of occupational choice 
establishes the structure for the model. The chapter 
concludes with a description and critique of Noblit's model. 

The middle section of the thesis, Chapters IV, V, and VI, 
is analytical in nature. Chapter IV describes the data used 
in this comparison a,id the development of the three analytical 
models. Chapter V presents the results of the classical 
regression model which is compared to the later neural network 
results. Chapter VI furnishes the results of the two neural 
networks employed in this comparison. Additionally, part of 
the chapter is devoted to discussing the choices made during 
the development of the ANN and the reasoning behind the 
selection of a network's characteristics. 

The last section of the thesis cont; ins Chapters VII and 
VIII. Chapter VII compares the forecasting capabilities of 
all three models and their usefulness to military manpower and 
personnel policy analysts. Chapter VIII contains concluding 
comments and recommendations for further research. 





II. NEURAL NETWORKS 


A. INTRODUCTION 

Forecasting the future is a difficult task. However, it 
is a necessity if manpower and personnel planners are to 
determine the effects of policy decisions on the shape and 
size of the military force. Planners must be able to 
determine the impacts of advertising on recruitment and 
bonuses on reenlistment. It is not enough to know increases 
in advertising and bonuses will increase enlistments and 
reenlistments. To derive the greatest benefit from 
instituting manpower and personnel policies, the incremental 
effects of a policy must be determined. 

Just as a carpenter relies on his tools to build a house, 
manpower and personnel planners rely on tools to forecast the 
future consequences of policy changes. Among these tools are 
econometric models based upon regression analysis. Recently, 
a new tool for forecasting the impacts of manpower policy 
changes has emerged. The design of this new tool, artificial 
neural networks, is based on the human brain. Recent studies 
provide supporting evidence that artificial neural networks 
produce the same or better results than classical regression 
techniques while allowing the researcher to relax the 
assumptions required when using regression. 











This chapter includes a brief description of the elements 
of neural networks and a review of the options a designer must 
choose among when developing an artificial neural network as 
well as a section concerning special data requirements. The 
chapter concludes with a discussion of several techniques 
available to measure the success of forecasting tools. 

B. NEURAL NETWORK COMPONENTS 
1. Background 

A neural network is a collection of neurons working 
together to produce a result based upon information provided 
to the network. Neural networks are either biological or 
artificial. The brain of an animal is an example of a 
biological neural network. The six senses of an animal allow 
it to detect important aspects of its surroundings and react 
to them accordingly. Current theory labels this ability to 
react to a changing environment as thinking. [Ref. 6:pp. 9-3.1] 

Artificial neural networks are an attempt to simulate 
biological neural networks. Biological neural networks are 
efficient at associative reasoning, learning, and thought. 
Artificial neural network designers hope to capture these 
qualities in order to enable computers to simulate more 
closely the processes of the brain. This will allow computers 
to adjust their functions to match the ever-changing 
environment in which they operate without having to undergo 
the complex steps of reprogramming. [Ref. 6;pp. 9-11] 
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2. Biological Neurons 


The basic element of a neural network is the neuron. 
The biological neuron is a complex cell comprised of four 
basic components. The soma is the center of the cell. The 
dendrites are the input channels to the soma. The aszon is the 
output channel from the soma. The synapses are the 
connections between different neurons within the network. It 
is important to note that synapses are not part of the cell 
but regions between the axon of a sending neuron and the 
dendrites of a receiving neuron. [Ref. 6:pp. 30-31] 

Biological neurons use a complicated and not yet fully 
understood process in their operations. However, the basics 
are simple to comprehend. The neuron receives an input from 
a synapses. This input travels through the dendrites to the 
soma. Inside the soma, the input accumulates over time. When 
the soma decides it has received enough inputs, it activates 
and sends a signal to other neurons. The axons carry this 
signal to the synapses which connect with the dendrites of 
other neurons.[Ref. 7:pp. 3-5] 

3. Artificial Neurons 

a. Overview Of The Imitation Process 

Artificial neurons attempt to imitate this process. 
The process in an artificial neuron consists of four phases. 
During the first phase, the processing element (in an 
artificial neural network, a processing element is analogous 
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to the biological neuron), receives inputs from either the 
outside world or other processing elements through input paths 
(dendrites). Prior to entering the processing element, these 
inputs receive a weight corresponding to the strength of their 
signal. Once the processing element receives all of the 
weighted inputs, these values move on to the next phase. 

[Ref. 8:pp. 4-5] 

The second phase of the process is activation of 
the processing element. During this phase, the processing 
element decides how to treat the weighted inputs. An 
activation function transforms all the weighted inputs into a 
single value, the activation value. In simple processing 
elements, the activation value is the sum of all of the 
weighted inputs. In more complex processing elements, the 
activation value can be a function of the weighted inputs and 
the previous state of the processing element. This permits 
the processing element to "self-excite", allowing previous 
inputs to have a continuing effect. Once the processing 
element determines the activation value this value moves to 
the next phase.[Ref. 6:pp. 84-85] 

During the third phase, the processing element 
converts the activation value into the output of the neuron. 
The transfer function of the processing element performs the 
conversion process of the activation value into the output 
value. Transfer functions can take on one of several forms. 
In developing an artificial neural network, the form of the 
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transfer function is left up to the developer of the neural 

network. [B f. 6;pp. 84-85] 

The final phase of the process is the output phase. 
The processing element sends the value of its output as 
determined by the transfer function's action on the activation 
value to one of two places. The output can be sent to other 
processing elements within the network or to the outside 
environment as a product of the network. Figure 1 contains a 
breakdown of the four phases. 

b. Types Of Transfer Functions 

The threshold function and the saturation function 
are two types of transfer functions. The threshold function 
is an "all or nothing" approach. The activation value must be 
above some minimum value, the threshold, for the processing 
element to contribute anything to the whole network. In 
networks that use the threshold function, if the activation 
value is below the threshold, the processing element's output 
is zero. Otherwise the processing element's output is one. 
[Ref. 6:pp. 85-86] 

The output values of a saturation function are 
continuous, unlike those of a threshold function. However, in 
a saturation function, excitation above some maximum firing 
level has minimal additional effect. Two commonly used 
saturation function are the sigmoid function and the 
hyperbolic tangent function. Figure 2 contains a graph of 
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Figure 1. Phases of the Aitifical Neuron Process 


Source: Russell (1993) 

each type of transfer function. These functions are 
advantageous because they provide a continuous, monotonia 
value for any input and are bounded by a high and low value. 
[Ref. 6:pp. 85-85] 

4 . Layers 

A neural network consists of many neurons, biological 
or artificial, organized into groups defined as layers. There 
are three types of layers. The first, the input layer, 
contains neurons that receive inputs from the outside world. 
In biological neural networks, these input neurons are located 
in the eyes, ears, and other sensory organs of animals. Input 
neurons in artificial neural networks receive data from the 
keyboard or other computer paraphernalia. [Ref. 8;pp. 4-5] 
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Sigmoid Function 


Hyperbolic H) Function 


Figure 2. Common Saturation Functions 

Soirot: Russ«l (1093) 

A second type of layer is the output layer. In the 
output layer, output neurons send information back to the 
outside world. In a human being, this information is 
transmitted through muscle activity such as voice activation, 
emotional responses, or body movement. Output neurons in the 
ar'tificial world send their signals via computer monitors or 

other output devices. [Ref. 6;p. 79] 

If a neuron is not in the input or output layer, it 
falls into the hidden layer. Neural networks can have one 
hidden layer, no hidden layer, or thousands of hidden layers. 
What happens within the hidden layer is not observable to the 
outside world. The neurons of the hidden layer are connected 
only to other neurons of the network. [Ref. 6:p. 79] 






5. Nstworks 


A neuron is the basic element of a neural network. 
Neurons are organized in groups called layers. These layers 
form networks. Biological neurons make up biological neural 
networks. The human brain is an example of a biological 
neural network. It contains approximately one hundred billion 
neurons each connected to approximately ten thousand other 
neurons. [Ref. 9:pp. 3-4] 

This vast network functions in parallel which means 
that many connections can occur simultaneously. This allows 
the brain to make roughly ten thousand trillion connections 
per second. The Cray Computer, the fastest computer developed 
to date, is serial which means it can only make one connection 
at a time. The Cray can make fifty million interconnections 
per second. The end result is that the human brain is about 
twenty million times faster than the "fastest" serial 
computer. [Ref. 9:pp. 3-4] 

The speed and parallel nature of the human brain allow 
it to be excellent at problem solving or pattern recognition 
but slow at complicated mathematical problems. The serial 
nature of computers facilitates the solving of mathematical 
problems but prohibits the computer from excelling at pattern 
recognition or pattern solving. Artificial neural networks 
attempt to bridge this gap. [Ref, 6:p. 14] 
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a. Network Types 


There are many different kinds of artificial neural 
networks. However, each type falls into one of two 
categories, feed-forward or feedback networks. The 
distinction between the two centers on whether or not the 
network pays attention to its results. If a network uses its 
previous result in determining the succeeding output, then the 
network is classified as a feedback network. On the other 
hand, feed-forward networks are so-called because the output 
of each processing element flows forward in the network and 
does not interact with neurons in either previous layers oi. 

the same layer. [Ref. 6:p. 148] 

Feedback networks often are constructed without a 
hidden layer. Every neuron is allowed to connect to every 
other neuron and to itself. In order for a feedback network 
to produce a result, it must continually adjust the processing 
element weights until the weights reach a steady state. It is 
impossible to predict how long this will take. However, most 
sygtems are able to accomplish the adjustments after a few 
iterations. Examples of feedback networks include Hopfield 
Networks, Content-Addressable Memories , and Traveling-Salesman 
Problem Networks. [Ref. 6;p. 153] 

Feed-forward networks produce results in the 
following manner. Each processing element receives input from 
processing elements in the previous layer. In the input layer 
these inputs come from the outside world. In the hidden 
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layer, these inputs come from the input layer or another 
hidden layer. In the output layer, these inputs come from a 
hidden layer. 

The processing element determines the weight of the 
input, calculates an activation value, and then decides on an 
output. This output is passed on to the next layer. The 
output flows from the input layer to the hidden layer(s) to 
the output layer to the outside world. In essence, feed¬ 
forward networks are hierarchial. Processing elements connect 
only with processing elements in the next layer. Unlike the 
processing elements of feedback networks, feed-forward network 
processing elements do not connect with themselves or 
processing elements in the same layer. Figure 3 contains a 
diagram of a feed-forward network with three inputs, two 
hidden layers each containing two processing element, and one 
output layer. [Ref. 6:p. 148-150] 
b. Network Learning 

Determining the weight of the connection between 
each processing element is the biggest feat that networks must 
accomplish. Networks accomplish this by a process called 
learning. Learning is the process by which the connection 
weights are adjusted to minimize the error between the desired 
output and the actual output. Initially, the weights of each 
connection are set randomly. The network is then presented 
with training pairs. A training pair is one observation from 
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Figure 3. Feed-Forward Neural Network 

Soucs: (1^3) 

the data set containing all input values and the desired 
output. Based on the input, the network predicts an output 
value. This predicted value is then compared to the actual 
output. [Ref. 6:p. 149] 

Differences between the actual and predicted values 
cause the network to adjust the connection weights. The 
weights are adjusted by a learning rule. The learning rule 
determines how the connection weights are adjusted throughout 
the network. Popular learning rule algorithms include 
Ferceptron, Adaline, Madeline, Boltzman, Counter Propagation, 
and Back Propagation. The mechanics of these algorithms are 
beyond the scope of this thesis. Detailed descriptions and 






the underlying mathematical formulas may be found in 
Wassermann (1989), Beale and Jackson (1991), and Muller and 
Reinhardt (1990). [Ref. 6:p. 151] 

One of the largest differences between feedback and 
feed-forward networks is that feedback networks do not learn, 
they are constructed. F'eed-forward networks learn based upon 
the presentation of input and the desired output. Feed¬ 
forward networks simulate learned behavior of an animal while 
feedback networks simulate instinctive behavior. [Ref. 6:p.l53] 

C. NETWORK DESIGN 

Researchers who use artificial neural networks face many 
decisions in the construction of a artificial neural network. 
What is the appropriate network architecture? How many hidden 
layers should the network contain? What transfer function is 
suitable for the network? What are the best learning rates? 
What is the appropriate learning rule for the network? These 
questions only scratch the surface of the decisions faced by 
researchers when building an artificial neural network. This 
section examines several of the decisions faced by artificial 
neural network developers. Although researchers devote much 
time and energy to determining the appropriate design for an 
artificial neural network, there are endless possibilities. 

1. Network Architecture 

Network architecture refers to the type of network to 
be used. Is a feedback or a feed-forward design the most 
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appropriate? To answer this question, the network designer 
must understand the applications for which the network will be 
used. Military manpower and personnel planners attempt to 
forecast, or predict, the decisions military service members 
will make. Feed-forward neural networks are the most 
successful for this application. [Ref. 8:p. 36] 

Klimasauskas (1991) suggests that back-propagation is 
the best architecture for neural networks designed for 
forecasting or classification. Forecasting stock market 
moves, predicting future job performance, bankruptcy 
forecasting, and credit risk determination are several of the 
uses for back-propagation architecture. [Ref. 3:p. 29] 

Yoon and Swales (1991) used neural networks to predict 
stock price performance. The researchers collected data from 
two sources used regularly by investors, Business Week and 
Fortune magazines. The artificial network design used a back- 
propagation architecture. The results of the neural network 
model were compared to forecasts made using multiple 
discriminate analysis. Yoon and Swales concluded that neural 
networks using a back-propagation architecture significantly 
improve the predictability of stock price performance. 

[Ref. 2:pp. 156-162] 

Raghupathi, Schkade, and Raju (1991) employed neural 
networks to forecast the bankruptcy rate of private companies. 
Data on 102 companies, 51 that filed for bankruptcy and 51 
that did not, were used to train a neural network with a back- 













propagation architecture. The authors concluded that neural 
networks provide a suitable model to predict bankruptcies. 
[Ref. 10:pp. 147-155] 

Wiggins, Engquist, and Looper (1992) compared neural 
networks and classical regression techniques in forecasting 
the re-enlistment rates of Air Force Enlisted personnel. 
Several neural network models were developed using three 
different architectures: back-propagation, probabilistic 
neural network (PNN), and learning vector quantization (LVQ), 
The results of these three models were compared to the results 
of a probit model. Wiggins et al. concluded that the back- 
propagation neural network provided the best results, 
especially in cases with large sample sizes. [Ref. li;p. 52] 

At this point, it is important to clarify an issue. 
Earlier it was stated that popular learning rule algorithms 
include back-propagation. Many times the network architecture 
and the learning algorithms the network use share the same 
name. This adds some confusion. However, keep in mind that 
the name of a network can describe how the network functions. 

2. Network Topology 

The topology of a neural network refers to the number 
of layers and quantity of processing elements within each 
layer. An artificial neural network has one input and one 
output layer. The input layer contains one processing element 
for each input variable. The number of input variables in a 
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network depends upon the configuration of the input data. 
Later in this chapter, further discussion addresses changing 
sample data into the appropriate form for neural networks. 

The output layer is similar to the input layer. One 
processing element in the output layer is required for every 
piece of output data. Output data faces the same constraints 
as the input data. Therefore, a transformation back to 
original form may be required for proper interpretation. 

Developers of artificial neural networks must decide 
on the number of hidden layers in a network and the quantity 
of processing elements in each hidden layer. Klimasauskas 
(1991) states that most research problems employing a neural 
network technique can be solved vising a topology with one 
hidden layer. If better performance is expected with more 
than one hidden layer, it is best to connect each hidden layer 
to all prior layers. This facilitates learning and increases 
the ability of the network to recognize complex patterns. 
However, if traditional statistical methods have provided 
satisfactory results, then connecting the input layer directly 
to the output layer is often very effective. [Ref. 12:p. 20] 

To summarize this concept, start without a hidden 
layer and examine the results. Then add one hidden layer and 
determine which network provides the best results. Adding 
additional layers should not improve network capabilities but 
may if the data contain complex patterns. 
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Lapedes (1989) furnishes an argument for utilizing no 
more than two hidden layers when using a back-propagation 
network. Using calculus, he supports the hypothesis that 
increasing the number of hidden layers provides no increase in 
the learning capabilities of an artificial neural 
network.[Ref. 13:p. 456] 

Once the decision about the number of hidden layers is 
reached, the designer must determine the appropriate quantity 
of processing elements in each hidden layer. If the number of 
processing elements in each layer is too large, the network 
will not learn the generalities of the input data but, rather, 
will memorize them and the network will be unable to handle 
inputs it has not already seen. [Ref. 14:p. 18] 

On the other hand, if the number of processing 
elements is too small, the amount of time required to train 
the network drastically increases. An excellent starting 
point is five processing elements in each hidden layer. 

[Ref. 14:p, 18] 

Since the decision about network tocology is more an 
art than a science, the best alternative is to start with a 
simple design, say one hidden layer with five processing 
elements. After the network has learned to a sufficient 
degree, adjust the number of layers and processing elements to 
determine if any improvement in results is detected. 
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3. KetworK Transfer Functions 

The transfer function within a processing element 
transforms the activation value to the output value of the 
processing element. The activation value is a function of the 
weighted inputs to the processing element. The two most often 
employed transfer functions are the hyperbolic tangent 
function and the sigmoid function. [Ref. 12!p. 23] Other 
choices include a linear function, a sine function, or a 
threshold function. The function choice is dependent upon the 
nature of the input data and what the network is trying to 
learn. 

When employing neural networks in forecasting 
problems, many of the input variables will be dummy variables 
egual to one if a condition exists and zero if i.t does not 
exist. In these situations, the best choice of a transfer 
function is the hyperbolic tangent or sigmoid function. The 
choice of which of these two to use depends on the forecasting 
goal. [Ref. 12:p. 20] 

Forecasting goals fall into one of two categories, 
average or exceptional behavior. Average behavior is defined 
such that the probabilities of choosing from a list of 
alternatives are roughly similar. For example, in forecasting 
the response rate of surveys, the chance a person will respond 
or will not respond are fairly similar. In stock prices 
predictions and bankruptcy forecasting, the goal is to find 
the exceptions. Researchers attempt to find the stocks that 
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will increase greatly in price or the company that stands the 
highest probability of failure. In forecasting exceptional 
behavior, the hyperbolic tangent function works best. In 
average behavior cases, the sigmoid function works best. 
[Ref. 12:p. 20] 

It is important to note that each processing element, 
regardless of its layer location, must possess a transfer 
function. It is not necessary that every processing element 
in the network have the same transfer function. Several 
different types of transfer functions may be used in a single 
network. For example, the neurons of the input layer have 
only one input each, the value of the input. Transforming 
these values using a hyperbolic tangent or sigmoid function 
serves no purpose. Therefore, the use of a linear transfer 
function is appropriate at this level. [Ref. 8:pp. 106-108] 
4. Network Learning Rules 

Learning is the process a network undergoes to match 
the desired output to the actual output. By adjusting the 
weights of each connection between the processing elements, a 
network accomplishes learning. Learning rules specify how 
connection weights are adjusted. Widrow and Hoff (1960) 
developed a learning rule, the Delta-Rule, which is the basis 
for several of the well known learning rules used extensively 
today. [Ref. 5:pp. 15-24] 
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a. The Delta Rule 

The Delta-Rule states that the change in connection 
weight Wj is equal to the learning rate times the error times 
the output from the processing element Xj. [Ref. 15;p. 218] 

A Wj=LearningRa ce* Ezrox *x^ 

The learning rate is a value between zero and one. 
The developer of the network determines tht learning rate and 
can adjust it as the network continues learning. An iteration 
is the presentation of one training set, both the independent 
and dependent variables, to the network. Initial iterations 
through the data should be done with a relatively high 
learning rate. As the number of iterations increase, the 
learning rate should be decreased. This allows the network to 
reach a global error minimum. If the learning rate does not 
decrease as the iterations increase, the error will oscillate 
around the minimum achievable error for the network. 

[Ref. 5:p. 17] 

The error is the difference between the actual 
output of the network based on the inputs and the desired 
output that is associated with those inputs. Each connection 
weight is assigned an equal part of the total error. If there 
are N weights, then the error term is calculated as follows: 









The N + 1 term accounts for the weight of the bias element 
contained in the network. [Ref. 5;p. 16] 

Widrow and Huff developed this learning rule for 
the simple, single-layered neural network called Perceptrous. 
Rumelhart, McCelland, and Williams (1986) developed a more 
complicated learning rule based upon the Widrow-Huff rule. 
This new development was designed for complex neural networks 
that employ multiple layers. 

Jb. Variations Of The Delta Rule 

Two variations to the delta rule, the Cumulative 
Delta Rule and the Normalized Cumulative Delta Rule, were 
developed to increase the efficiency of neural networks. 
Before reviewing these rules, two new terms need introduction, 
epoch size and momentum. 

Epoch size is the number of training pairs that are 
presented to the network before adjusting the connection 
weights. Simply put, the epoch size is the number of 
iterations the network undergoes before any of the connection 
weights are adjusted. [Ref, 5:p. 30] 

Momentum is a factor added to the delta rule to 
allow the network to minimize the error across the entire 
network and not become "stuck" in a local minimum. The factor 
is based on previous changes in the connection weights and is 
between zero and one. If the weight changes are large, then 
the momentum tends to be large but as the changes become 
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Momentum assists in 


smaller, so does the momentum factor, 
speeding up the convergence to a minimum error.[Ref. 16:p. 92] 
The Cumulative Delta Rule updates the weights after 
each epoch. Cumulative weight changes are stored and then 
propagated through the network after the presentation of a 
builder defined number of iterations or epoch size. These 
weight changes will contain both increases and decreases in 
connection weights thus allowing for a net, cumulated effect 
to occur on the weights. The benefit here is that this 
decreases processing time because the error is not propagated 
back through the network after each iteration. [Ref. 5:p. 30] 
The Normalized Cumulative Delta Rule is similar to 
the cumulative Delta Rule in the determination of the change 
in the connection weights. The difference is that the changes 
in the weights are normalized. To normalize the weight 
changes, the learning rule algorithms transforms the 
connection weight vector into a unity length vector dy 
dividing it by its magnitude. The advantage to making this 
modification is that it further reduces the training time of 

the network. [Ref. 16:p. 115] 

The Delta-Rule and its two variations, the 

Cumulative Delta and Normalized Cumulative Delta Rules, are 
possible choices for the learning rule used by an artificial 
neural network. Of these three rules, network designers favor 
using the Normalized Cumulative Delta Rule when working with 
back-propagation networks. (Ref. 12:p. 22] Of the three 


29 








choices, it is the quickest and requires the least amount of 
CPU processing time. 

D. DATA CONSIDERATIONS 

The type of data fed to the network is as important as the 
network development. As in classical regression techniques, 
all of the data presented to a model must be in numerical 
form. Neural networks put a further limitation on the data. 
Networks work best with data in the range of -l to +l or 0 to 
1. Each input variable for the neural network model must be 
converted to this form. [Ref. 3:p. 28] 

In modeling manpower and personnel decisions, independent 
variables are usually either continuous numeric variables or 
"dummy" variables. The "dummy" variable is the easiest for a 
neural network to handle because it has been previously coded 
as a zero or a one. The value of the variable is one if a 
certain characteristic is present and zero if it is not. For 
example, if an individual is a male, the variable MALE is set 
equal to one. If the individual is a female, the variable 
MALE is set to zero. 

Continuous numerical variables pose a slightly more 
complicated problem. These variables must be re-scaled to fit 
in the range from -1 to 1 or 0 to 1. Although this is not 
extremely difficult, it can be time consuming unless the 
neural network software being utilized has the capability of 
completing this transformation transparent to the operator. 
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Fortunately, NeuralWorks Professional _II/Pl.us possess this 

capability. 

Output falls under the same restrictions as the input 
data. since the decision to stay or leave the military is 
simply a choice between only two items, then a dummy variable 
can be used ae the output variable. The variable can be 
assigned a value of one if the observation left the service 

and a zero otherwise. [Ref. 3:p. 28] 

In processing the output data for delivery to the network, 
NeuralWorks scales the values into a smaller range. The range 
for a two condition dummy output variable is either -0.8 to 
0.8 or 0.2 to 0.8. The purpose behind this is to allow the 
network to determine results for new input data outside the 
range of the original data. 

E. MEASURES OF SUCCESS 

In making a comparison between models to determine which 
is the "best" at forecasting and provides the "best" fit to 
the data, researchers must develop some type of measurement 
criteria. This section reviews current measures. 

Hill, O'Conner, and Remus (1991) compared neural networks 
and classical regression techniques using time series data. 
A variety of models were created using v;eekly, monthly, and 
yearly time series data. In determining the best model for 
forecasting using time series data. Hill et al. used the 
absolute percentage forecast error (APE). This measure was 
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chosen based on its applicability to time series data. 
Additionally, APE is a measure used quite often and appears to 
be the most robust measure available. [Ref. I7:p. 12] 

APE is calculated based upon the predicted value of the 
forecast, the actual value of the forecast, and the total 
number of residuals. The formula to determine the APE of a 
certain model is: 

AP£'=i*(V (^)) *100 

Where N is the number of residuals, P; is the predicted value, 
and Aj is the actual value of the forecast. [Ref. 7:p. 55] 

Sharda and Wilson (1993) review several different measures 
of success when evaluating model performance. The authors do 
not specify which is the "best" metric to use. They do 
conclude, however, that more than one measure should be used 
when comparing forecasting techniques. 

Two of the metrics proposed by Sharda and Wilson are 
easily calculated and effective measures for classification 
experiments. Since this study focuses on "classifying" a 
military person as a "taker" or "non-taker" of the separations 
incentive programs, the classification metrics are the best 
suxted for thxs research. 

Prior to examining the actual measures, it is important to 
classify the possible results of a forecasting model. 
Forecasting models have the potential to provide four results: 
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'true positive, true negative, false positive, or false 
negative. If the model predicts the observation will take the 
program and he does, this is a true positive. If the model 
predicts the observation will not take the program and he does 
not, this is a true negative. A false positive occurs when 
the model predicts the observation will take the program but 
in reality he does not. A false negative occurs when the 
model predicts the observation will not take the program but 
he does. 

These four possibilities are represented by the following 
symbols : 

O, =■■ True positive 

(predicted taker and actual taker) 

On ■- True negative 

(predicted non-taker and actual non-taker) 

Ft = False positive 

(predicted taker and actual non-taker) 

Fn = False negative 

(predicted non-taker and actual taker) 

The first metric, M,, provides a measure of success in 
predicting the correct category. Mj is defined in two parts. 
The first measure is the probability of correctly predicting 
a ’’taker" (M„) and the second is the probability of 
predicting a "non-raker" (”in) • 
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The values of these measures are defined by the following 


equations: 




Or 


M. 


in 


AND 

_ On 

0.+F„ 


The other measure, is an inefficiency measure. It 
symbolizes false classifications. Like Mj, Mj is composed of 
two values, one for the model's ability to forecast "takers" 
and the other to represent the model's ability to forecast 
"non-takers". The variables are the same as defined for M,. 
The two values of are; 


AND 





The goal of any forecasting model would be to maximize M, 
and minimize M^. [Ref. 18;pp. 649-656] 

Wiggins, Engquist, and Looper (1992) use a simulated 
when comparing the results of an artificial neural network 
model to previously developed models using classical 
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regression techniques. The benefit of this measure is its 
ability to identify models that do not provide better 
estimates than provided by examining the sample means. The 
simulated will produce a negative value in these instances. 
In instances where perfect prediction occurs, the simulated R 
takes on the value of one similar to the coefficient of 
determination used in measuring the effectiveness of classical 
regression models. [Ref. ll:pp. 14-15] 

The simulated R^ formula is: 




Where: 

Pj is the predicted take-rate for observation i, 

Aj is the actual decision made by observation i, and 
AVG is the average take-rate over the whole sample. 


Prior to developing a neural network, measures of success 
must be determined. This is critical in research where the 
goal is model comparison. Any of the previously mentioned 
methods are effective measures in determining which is the 
appropriate model to use. However, it should be kept in mind 
that it is best to use several measures (Sharda and Wilson 
(1993). 
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F. COKCLUSIOM 


A review of current literature supports the hypothesis 
that artificial neural networks provide forecasting results at 
least as good if not better than results provided by classical 
regression techniques. The network architecture that provides 
the best structure is back-propagation. Researchers in the 
manpower and personnel fields feel that this architecture is 
the best suited for forecasting individual decisions, 
especially for large data sets. 

Since the application of ANNs is relatively new, no 
"standard" technique or rule exists for constructing a neural 
network. Little of the literature addresses the different 
aspects of network design. Researchers fail to comment on the 
number of hidden layers, the number of processing elements 
within each hidden layer, the learning rules, the learning 
rates, and the transfer functions of their networks. 

Based on the literature review, this thesis builds two 
different models and compares the results of each to the 
results of a logit model. When developing the neural network 
models, the research employs a two-step approach. 

During the first phase, a network with one hidden layer 
and thirty processing elements in the hidden layer is 
constructed. In this phase, the different possibilities for 
learning rule, learning rate, epoch size, and transfer 
function are adjusted to obtain peak performance. 
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j^liiticisciuskas (1991) providfis excellsnt infoirination on ths 
choice of learning rules, learning rates, epoch sizes, and 
transfer functions. Each of these facets of a neural network 
are optimized. The optimization process is described in later 
chapters. 

Phase two changes the network topology while holding the 
other aspects of the network constant. A hidden layer is 
added and the amount of hidden neurons is adjusted. The first 
hidden layer contains twenty neurons and the second hidden 
layer contains ten hidden neurons. 

The comparison stage of the research employs several of 
the success measures previously discussed. Each of the three 
models are evaluated using three criteria: the number of true 
positive, the number of false positive, and the simulated R^ 
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III. BACKGROUND ON THE VSI AND SGB PROGRAMS 


A. INTRODUCTION 

When the draft ended in 1973, military manpower policy 
makers faced the problem of meeting force manpower needs 
without the assistance of a draft. Policy was developed and 
implemented to attract and retain high quality personnel. To 
ensure these policies achieved the required goals, manpower 
analysts conducted numerous studies analyzing retention and 
turnover behavior. 

Present day requirements necessitate a smaller military 
force for two reasons. The first reason is the reduced threat 
brought on by the end of the cold war. Since the collapse of 
the Soviet Union in 1991, a constant topic of debate centers 
on the proper size of the United States Military. 

Increased public concern over government spending is the 
other reason. The large federal budget deficit and resulting 
debt have brought all federal programs under scrutiny for 
possible reductions in cost. 

Currently, two programs, the Voluntary Separation 
Incentive (VSI) and the Special Separation Bonus (SSB), are 
being offered to entice career military service members to 
leave the service. This section of the thesis provides 
background information on these two programs, the theoretical 
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framework of occupational choice, and an examination and 
critique of a classical regression forecasting model built to 
predict the "take-rates" of these two programs among United 
States Marine Corps Enlisted personnel. 


B. BACKGROUND 

The 1992 Department of Defense (DoD) Authorization Act 


provides the military services with the VSI and SSB programs. 
The Marine Corps views these programs as positive force 
management tools which assist in shaping the career force 
structure. Essentially, these programs provide monetary 
incentives to military service members who chose to leave the 
service. This section provides a detailed description of each 
program, program eligibility requirements, and a description 
of the program implementation process used by the Marine Corps 
[Ref. 19:p. 1]. 

1. Voluntary Separation Incentive Program Description 

The VSI program is an annuity payment made over 
several years. The payment continues for a period of time 
equal to twice the number of years the service member has 
served on active duty. The amount of the payment is 
calculated using the following formula; 


ANWUALi = (*023) A rlOnu4iv.> *. / a 

PAYMENT (Years of Active Service) 

Fractions of years of service are calculated as they are for 
retirement purposes. Table 1 presents four of the possible 
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VSI payments a Marine Corps Enlisted Member could receive. 
[Ref. 19:p. 2] 

TABLE 1. 


VSI PAYMENTS FOR SELECTED MARINE ENLISTED PERSONNEL 


RANK 

YEARS OF 
SERVICE 

BASE PAY 

VSI PAYMENT 

NUM OF 
YRS 

E-4 

6 

$1,082.10 

$1,974.78 

12 

E-5 

10 

$1,426.50 

$4,279.50 

20 

E~6 

12 

$1,706.70 

$6,144.12 

24 

E-6 

14 

$1,760.10 

$7,392.42 

28 

E-7 

16 

$2,022.90 

$9,709.92 

32 

Source: 

Author 





Footnote: These payments are for FY 93 separations and 

represent before tax amounts. 


Service members who chose VSI incur an obligation to 
serve in the ready reserve. The reserve commitment lasts as 
long as the service member continues to receive VSI annuity 
payments. If at any time the member is separated from the 
ready reserve, VSI payments stop. However, VSI payments 
continue if either of the following conditions apply: 

1) The member dies (payments will continue to the 
member's laeneficiaries) ; or 

2) The member becomes ineligible for continued 
service in the ready reserve for any reason that is not under 
his control (i.e., age, promotion failure, medical, etc.). 
[Ref. 19;p. 1] 
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2. Special Separation Benefit Program Description 

SSB differs from VSI mainly in that SSB is a lump sum 
payment given upon transfer from the active force to the ready 
reserve. The lump sum is computed using the following 
formula: 

PAYMENT = (.15) X (Final Monthly Base Pay) X (12) X 
(Years of Active Service) 

Fractions of years of service are computed in the same way as 
for a member who receives non-disability separation pay. 
Table 2 presents five of the possible SSB payments a Marine 
Corps Enlisted Member could receive. [Ref. 19:p. 3] 


TABLE 2. 

SSB PAYMENTS FOR SELECTED MARINE ENLISTED PERSONNEL 


RANK 

YEARS OF 
SERVICE 

BASE PAY 

SSB PAYMENT 

E-4 

6 

$1,082.10 

$11,848.68 

E-5 

10 

$1,426.50 

$25,677.00 

E-6 

12 

$1,706,70 

$36,864.72 

E-6 

14 

$1,760.10 

$44,354.52 

E-7 

16 

$2,022.90 

$58,259.52 


Source: Author 

Footnote: These payments are for FY 93 separations and 

represent before tax amounts. 


Members who elect separation under SSB incur a three 
year obligation to serve in the ready reserve. [Ref. 19;p. 3] 











3. Separations Benefits 


The transition benefits for separating members changed 

as the programs were implemented. During fiscal year 1992, 

members separated under VSI were entitled to the same 

transition benefits as those members who voluntarily left the 

service, but members who left the service under the SSB 

program received the same benefits as members who were 

involuntarily separated from the service. The important point 

here is that the non-pecuniary benefits associated with each 

program were different in fiscal year 1992. For example, a 

service member choosing SSB is entitled to two years of 

coiTunissary and exchange privileges after his discharge. An 

individual selecting VSI does not receive any commissary or 

exchange privileges. [Ref. 13:pp. 2-3] 

In fiscal year 1993 this changed. The 1993 DoD 

Authorization Act standardized the non-pecuniary benefits 

members received when voluntarily leaving the service under 

the VSI and SSB programs. [Ref 20:pp. 2-4] 

4. Program Eligibility Requirements 

If a member decides to separate from the service, he 

can chose either the VSI or SSB program. The two programs are 

designed to be offered simultaneously. In order to qualify 

for the programs, Congressional Acts require that service 

members meet the following criteria: 

1) Serves on active duty for more than six years as of 
5 Dec. 1991; 
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2) Completes their initial term of enlistment; 

3) Serves at least five years of continuous active 
duty immediately preceding the date of separation; 

4) Are not eligible for retirement; and 

5) Are a regular officer or enlisted, or a reserve 
officer on the active duty list. [Ref. 21:p. 3] 

In addition to these basic requirements, each service 

could further delineate eligibility requirements. The Marine 

Corps chose to further limit participation in the program. 

Marines in the following categories are ineligible for the 

bonuses: 

1) Those who have received a re-enlistment bonus in 
fiscal years 1991 or 1992; 

2) Those who are pending administrative action that 
could result in a involuntary discharge (i.e., 
court-martial); 

3) Those who have been previously denied re-enlistment 
during fiscal year 1992; 

4) Those who require a waiver for re-enlistment; and 

5) Those who have previously refused to extend or re¬ 
enlist to avoid accepting orders to a new duty 
station. [Ref. 19:p. 3] 

5. Marine Corps Program Implementation 

Implementation of the separation programs by the 

Marine Corps is fairly straightforward. The programs are 

offered in three phases. Both officers and enlisted personnel 

are eligible for each phase of the program. However, since 

the focus of this thesis is on enlisted personnel, only the. 

implementation plan for enlisted personnel is reviewed. 
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Phase I for both fiscal year 1992 and 1993 targets 
Marine enlisted personnel with military occupational 
specialties that are no longer needed by the Marine Corps. 
These specialties are for equipment that the Marine Corps no 
longer possesses in its inventory. For example, a Marine with 
a military occupational specialty that deals with operating or 
maintaining the F-4 aircraft is ineligible for the program, 
because the Marine Corps has phased out the F-4 aircraft and 
has replaced it with the newly acquired F-18. [Ref. 22:p. 2] 

Additionally, during Phase I of fiscal year 1993, the 
programs are opened to enlisted members who are not selected 
for promotion to E-7 or E-8 but are in the promotion zone. 
Also, E-5s and E-6s who have more than eleven or fifteen years 
of service, respectively, are eligible for the programs, 

[Ref. 23:p. 3] 

Phase II for both fiscal years is designed to elicit 
volunteers from miliary occupational specialties that are over 
staffed. The programs levy specific grade and years of 
service requirements. Enlisted personnel in the ranks of E-4 
through E-7 are eligible. For example. Marines with the MOS 
0151 (Administrative Clerk) who are E-5s and have more than 
eleven years of service arc eligible to apply for the 
separations incentives. The goal is to reduce overages in 
these skills in order to put promotions back in line with the 
rest of the enlisted force. [Ref, 22:pp, 2-3] 
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Phase III for fiscal year 1992 targets E-6s and E-7s 
in crowded military occupational specialties. For example, E" 
6s and E-7S who possess the MOS 0369 (Infantryman) can apply 
for the programs. Again, the goal is to shape the force and 
relieve the overcrowding pressure that has led to a slow-down 
in promotions for several different communities. During 
fiscal year 1993, Phase III is only open to officers. 

[Ref. 20:p. 2] 

C. THEORETICAL FRAMEWORK 

The decision process a service member goes through to 
determine if he/she will stay in or leave the military is 
unique to the individual and involves many non-observable 
factors. Individual preferences for military or civilian life 
are difficult to measure and weigh heavily in the decision to 
stay or leave the military. The economic theory that best 
fits this decision is the theory of occupation choice. 

This theory states that an individual will select the 
occupation that provides the largest expected lifetime 
utility. This utility includes both pecuniary and non- 
pecuniary aspects. Occupations with higher levels of 
compensation attract more individuals than those with low 
compensation (holding all else constant). However, other 
factors contribute to the decision about which occupation to 
choose. These factors include individual preferences for 
occupational tasks and working conditions (i.e., risk. 
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cleanliness, management philosophy, family separation time, 
etc.)* [Ref. 24:p. 258] 

If an individual decides the military is the occupation 
best suited to maximize his utility, he has examined not only 
the actual monetary compensation he can receive, but the non¬ 
monetary aspects of military life as well. Intangible factors 
such as the individual's preference for military order and 
discipline, family separations, patriotic feelings, the stigma 
(positive or negative) associated with military service, and 
other elements contribute to the occupational choice decision. 

The population targets for the VSI and SSB programs are 
service members who have finished their initial obligations at 
a minimum and have chosen to remain in the service. They are 
experienced with the military lifestyle. Since they have 
chosen to remain in the military, it is evident that this 
lifestyle is preferred over the life style they perceive they 
would live if they leave the military for another occupation. 

Most military personnel have limited experience with the 
civilian labor market but have family and friends who are in 
the civilian labor force. These contacts provide information 
useful in determining if the service member's preferences are 
better suited in a civilian occupation. Although these 
contacts do not provide perfect information, they do provide 
some information to the individual. The decision to remain in 
the military comes about because the individual believes his 
utility is maximized in the military environment. 
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Previous studies support the theory that the potential 
lifetime earnings of an individual are based upon three 
components; observable ability indicators (education, 
experience, mental aptitude, etc.), socioeconomic factors 
(race, gender, marital status, etc.), and non—observable 
components of ability (assertiveness, loyalty, responsibility, 
etc.). These studies include Tinney (1991), Bsrndt (1988), 
and Cymrot (1987). 

Tinney (1991) develops a mathematical equation that 
summari 2 es the occupational choice theory. He states that the 
potential lifetime earnings E of an individual i in occupation 
j is a function of observable ability indicators OA, 
demographic characteristics D, and non-observable ability 
indicators NA. This relationship is represented by: 

Eij = f(OAi, Di, NAi) . (1) 

Tinney further states that the present value V to an 
individual i in occupation j is a function of the present 
value of lifetime earnings Ey, observable individual 
preferences and family background effects OPi, and non¬ 
observable individual preferences and family background 
effects NP;. This may be represented by: 

Vy = g(Ey, OPi, NPi) . (2) 

The theory of occupational choice states that individual 
i chooses occupation j when V is maximized. [Ref. 25] 
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Noblit (1993) applies this concept to the stay or leave 
decision facing members of the Marine Corps and develops the 
following mathematical equations: 


Via, 

= g(Ei„,, 

OPi, 

UP;) 

(3) 

Vie 

= g(Eie/ 

OPi, 

UPj) . 

(4) 


where the subscript "m" indicates the occupation of being in 
the Marine Corps and the subscript "c" represents a civilian 
occupation. [Ref. 21:pp. 23-45] 

Based on these equations. Noblit stated that if 
then the individual remains in the service. If the opposite 
is true, and > V;,,,, then the individual separates from the 
service [Ref. 21:pp. 23-45]. Therefore, in order for an 
individual to take either of the separation bonuses, the 
present value of the dollar amount of the bonus Bj plus the 
present value of the non-monetary separations benefits SBj must 
be at least equal to the difference between the present values 
of a job in the Marine Corps and a job in the civilian sector, 
or: 

B. + SB, > V,„ - Vj, (5) 

Equation (5) must be satisfied for an individual to take 
either of the separation bonuses. 

As a member continues his active duty service and is 
closer to reaching the minimum retirement criteria, the 






present value of military retirement increases. At the same 
time, the number of years to accumulate a civilian retirement 
decreases. Additionally, an individual becomes less likely to 
leave the service the longer he remains because he grows more 
accustomed to the military lifestyle and less likely to 
undertake a career change. 

These two factors cause a widening of the gap between the 
present value of a military occupation and the present value 
of a civilian occupation as the length of service increases. 
If the goal of the services is to attract senior members to 
voluntarily leave the service, the present value of the 
separation benefits (both pecuniary and non-pecuniary) must be 
a positive function of years of service. Since the non- 
pecuniary aspects of each program are constant among service 
members, the monetary value of each separation incentive 
program must increase with the individual's years of service. 

D. THE NOBLIT MODEL 

In building his model to predict the "take-rates" of the 
VSI and SSB programs, Noblit reviews current literature on the 
effects of the Selective Reenlistment Bonuses, voluntary 
termination from the service, reenlistment behavior of 
careerists, and retention behavior. These studies laid the 
foundation for his model, [Ref. 21:pp. 11-34] This section 
provides a description of the model and a brief discussioxi of 
the variable choice„ 
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1. Functional Form 

The decision to take either of the separation bonuses 
is, in essence, a stay or leave decision by the individual. 
Therefore, the dependent variable of Noblit's model is 
dichotomous. When modeling a dichotomous dependent variable, 
researchers may choose a model from several options with 
different functional forms. [Ref. 26:pp. 510-529] Noblit's 
model is a binomial LOGIT which uses the maximum likelihood 
estimation technique. 

The binomial LOGIT model has the following functional 

form: 

p. =-i- 

Pj is the probability that individual i has the characteristic 
defined by P, * 1. The "betas” are the parameter estimates. 
X, represents the vector of explanatory variables. The epsilon 
is the stochastic error term associated with the model. 

2. Explanatory Variable Selection 

The theoretical model of occupational choice states 
that an individual chooses the occupation that maximizes his 
lifetime utility. In choosing between a military occupation 
and a civilian one, an individual examines the present value 
of military compensation, the present value of civilian 
compensation, and the present value of non-pecuniary factors 
that determine the individual's taste for military service or 
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civilian service. Noblit's model develops proxy variables for 
each of these elements that contribute to the stay or leave 
decision. 

a. Present Value Of Military Compensation 

Developing proxy variables for the present value of 
military compensation is the easiest element of the stay or 
leave decision to model. Military compensation is based upon 
three categories. First, there are the entitlements that all 
Marines are eligible for: base pay, housing allowances and 
subsistence allowance. Base pay is determined by the 
individual's rank and years of service. Housing allowances 
and subsistence allowances are computed based upon rank and 
marital status. 

Second, total military compensation includes 
reenlistment bonuses paid to enlisted service members. These 
bonuses are a function of reenlistment contract, rank, base 
pay, and the current stock of personnel relative to 
requirements in similar military occupational specialties. 
Not all enlisted members are entitled to or receive re¬ 
enlistment bonuses. 

The last element of military compensation centers 
around proficiency pay. Marines receive extra pay if they 
possess certain skills. These skills include airborne or 
scuba qualified, flight crew member, explosive ordnance 
disposal, and foreign language capable. Marines serving as 












embassy guards, recruiters, and drill instructors receive 
proficiency pay as well. 

Noblit chose four explanatory variables to act as 
proxies for the present value of military compensation. These 
variables represented the individual's rank, marital status, 
and whether or not the individual received either a 
reenlistment bonus or proficiency pay. [Ref. 21;p. 40] 

Each of these variables are factors which determine 


the total monetary compensation an individual receives for 
active military service. A service member can calculate the 
present value of his expected military compensation because 
this compensation is fixed by law and is based upon rank, 
years of service, marital status, job proficiency, and 
reenlistment bonuses. If the individual assumes he receives 
promotions and longevity raises at the same rate as his 
predecessors, then calculating the present value of his 
military pecuniary benefits is straightforward. 

An interesting point hex'e is that Noblit chose to 
leave years of service out of his model. This was done mainly 
because of the high correlation between rank and years of 
service. The military is a hierarchical organization and 
members, with few exceptioiis, reach the upper grades only 
after serving for a particular amount of years. 

[Ref. 21:p, 40] 
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Jt>. Present Value Of Civilian Compensation 


Civilian compensation is a function of observable 
ability indicators and socioeconomic factors. Noblit develops 
four proxy variables to represent the present value of 
civilian compensation. These variables represent the 
education level, gender, race, and military occupation 
specialty of the individual. The large number of different 
occupational specialties within the military necessitated the 
grouping of individuals into four similar occupational 
categories. The groupings were; 1) Admin/Supply, 2) Combat, 
3) Combat Service Support (non-technical), and 4) Combat 
Service Support (Technical). [Ref. 2l:pp. 41-43] 
c. Non-Pecuniary Job Choice Factors 

Noblit creates six proxy variables to represent the 
individual's satisfaction with miliary life. These variables 
represent promotion rates, type of duty, and job satisfaction. 

Two explanatory variables represent promotion 
rates. The first variable is the length of time the 
individual Las spent in his present grade. The other 
explanatory variable is a promotion index. This promotion 
index indicates if the individual is being promoted at a 
faster or slower rate than his peers. It is calculated by 
dividing the individual's time-in-service at his last 
promotion by the average time-in-service at the last promotion 
for Marines in the same grade and occupational specialty. If 


53 









the index is greater than one, the individual is being 
promoted more slowly than his peers. If it is less than one, 
the opposite is true. [Ref. 21;pp. 43-44] 

Two explanatory variables represent type of duty. 
Marines are classified into three categories of duty: Atlantic 
Operating Forces, Pacific Operating Forces, and supporting 
forces. Noblit develops one variable to indicate the 

assignment of Marines to units in the Atlantic Operating 
Forces. Another variable represents Marines who are not 
assigned to the operating forces but to the supporting 
establishments. Marines in the Pacific Operating Forces are 
treated as the omitted condition in the model. [Ref. 21sp. 44] 

Two explanatory variables represent job 
satisfaction. One variable indicates whether or not the 
individual is in his primary military occupational specialty. 
The other measures how much time remains on the individual's 
enlistment contract. This is a good proxy variable for job 
security. Those close to their end of active service may fear 
they can not re-enlist thus causing job dissatisfaction. 

[Ref. 2l:p. 45] 

Table 3 contains a list of the types of variables 
Noblit uses in his model. 

E. CRITIQUE OF THE NOBLIT MODEL 

Noblit's model is well specified and built upon a strong 
theoretical foundation. Each proxy variable represents 
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factors that, in theory, are important in deciding to stay or 
leave the military. Noblit chose the proper functional form 
for the model. The only area for improvement is the choice of 
proxy variables. The present value of military compensation 
is well represented. However, both the present value of 
civilian compensation and the non-pecuniary aspects of 
military life could be represented more thoroughly with the 
addition of several other proxy variables. 

TABLE 3. 

EXPLANATORY VARIABLES USED TO BY NOBLIT PREDICT THE 
"TAKE-RATES" OF THE VSI AND SSB PROGRAMS AMONG 
M^lRINE corps enlisted PERSONNEL 


Variable Description Proxy For; 


Rank 

Mil. 

Compensation 

Proficiency Pay 

Mil. 

Compensation 

Marital Status 

Mil. 

Compensation 

Bonus 

Mil. 

Compensation 

Educational Level 

Civ. 

Compensation 

Gender 

Civ. 

Compensation 

Race 

Civ. 

Compensation 

Mil. Occ. Specialty 

Civ. 

Compensation 

Time In Grade 

Non- 

•Pecuniary Factor 

Promotion Index 

Non- 

■Pecuniary Factor 

Work In Primary MOS 

Non- 

•Pecuniary Factor 

Job Location 

Non- 

•Pecuniary Factor 

Time Left On Contract 

Non- 

•Pecuniary Factor 


Source; Author," 
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In addition to the proxies Noblit specified to predict 
civilian earnings, there are three other possibilities. 
First, the mental ability of an individual has an impact on 
his potential to earn more money in the civilian sector. The 
GCT score of the individual can be used as a proxy variable 
for the individual's mental capability. 

Age is another factor that impacts civilian earnings. Age 
tends to act as a proxy for the amount of experience 
individuals possess and their level of maturity. Civilian 
earnings is a function of work experience. Work experience is 
a function of age. It is understood that military and 
civilian experience differ in some respects. An individual 
may not be able to apply all of his military skills in a 
civilian job, but some of the skills developed while in the 
Armed Forces are transferable. 

The last possibilities of proxies for the present value of 
civilian earnings potential centers around the additional 
occupational specialties a Marine has been assigned. 
Assignment to jobs such as recruiting duty, embassy duty, 
instructor duty, and joint duty are highly competitive. Only 
the best and the brightest receive these assignments. Proxy 
variables designed to capture this information may reveal 
information about the non-observable capabilities of the 
individual that would have an impact upon civilian earnings. 

To assess the present value of non-pecuniary aspects of 
military life, two additional variables should be added to 
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Noblit's model. The Marine Corps is highly physical and 
places great demands on individual to maintain certain 
standards. These standards include both weight and athletic 
ability. Individuals who are not athletically inclined or 
tend to gain weight more easily than others may put forth more 
effort to maintain the standards than the average Marine. 
This extra effort may not result in any extra reward for the 
individual which in turn may lead to higher dissatisfaction 
with the service. The individual's personal fitness test 
score should be added to the model as proxy variables to 
account for this potential source of job dissatisfaction. 

The other variable needed in Noblit's model centers around 
the amount of time an individual spends away from his family 
in a deployed status. Marines deployed for long periods of 
time tend to show greater dissatisfaction for military life. 
The amount of time an individual has spent deployed will act 
as an excellent proxy for this negative non-pecuniary aspect 
of life in the Marine Corps. 

Noblit's model forms an excellent starting point. 
However, the addition of five explanatory variables captures 
more of the aspects that make up the decision to stay or leave 
the military. The five variables are contained in Table 4. 
The cost of adding these five variables is relatively low. 
The variables Noblit used as well as the five, additions are 
all available through the Marine Corps Enlisted Master File. 
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TABLE 4 


ADDITIONAL EXPLANATORY VARIABLES USED TO PREDICT THE 
"TAKE-RATES" OF THE VSI AND SSB PROGRAMS AMONG 
MARINE CORPS ENLISTED PERSONNEL 


Variable Description 

Proxy For: 


GCT Score 

Civ. Compensation 


Age 

Civ. Compensation 


Additional MOS Type 

Civ. Compensation 


Deployed Time 

Non-Pecuniary P'actor 


Physical Fitness Test Score 

Non-Pecuniary Factor 



Source: Author. 


F. CONCLUSION 

The theory of occupational choice sets the framework for 
the model Noblit built to predict the "take~rates" of the VSI 
and SSB programs for Marine Corps enlisted personnel. The 
model is appropriate for the data and the explanatory 
variable,^ are excellent proxies for use in modeling the stay 
or leave decision. Five additional variables are suggested to 
capture more of the factors that weigh in the decision. 
Utilization of the model will allow manpower planners to 
implement the separations incentive programs with more 
efficiency, 
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IV. DATA AMD METHODOLOGY 


A. INTRODUCTION 

One method of determining the predictive power of 
artificial neural networks is to compare the forecasting 
capabilities of an artificial neural network with some other 
established prediction tool. Since military manpower and 
personnel planners use classical regression techniques in 
forecasting the effects of policy changes, this research 
compares the forecasting abilities of neural networks with 
classical regression techniques. 

This comparison involves four steps. First, an 
appropriate data set on the behavior of interest is obtained. 
The second step is to develop a classical regression model 
based upon previous research with the aid of a statistical 
software package, SAS . The third step is to build two neural, 
network models using a commercially available software 
product, NeuralWorks Professional II/Plus . The fourth, and 
last step, is to compare the results of each of the neural 
network models to the regression model. 

This chapter of the thesis outlines the data set and 
variables used to develop all three models and then provides 
a description of each model's structure. 









B. DATA 


The research utilizes a data set obtained from the Master 
Files of Enlisted Personnel maintained at Headquarters, Marine 
Corps, Washington, D. C. It contains the population of 
eligible enlisted personnel who are qualified to leave the 
Marine Corps under the VSI and SSB programs during fiscal year 
1993. The data set was developed by manpower planners within 
the Enlisted Force Planning section of the Manpower Management 
Division. 

The original data set contains 19,212 observations. After 
deleting observations with missing values and removing 
observations that contain values which make little logical 
sense, there are 17,535 observations. Further restrictions 
placed on the data result in a final subset of males who meet 
the follov/ing criteria: 

1) are an E-5 or an E-6; 

2) have between eight years of service (YOS) and 
fourteen years of service, inclusive; 

3) are at least 26 years old but not older than 34 
years old; and 

4) have at least a high school education. 

The resulting data set contains 8,042 observations. Table 5 
provides a summary of the deletions performed on the 
population of eligible enlisted members who qualify for the 
VSI and SSB programs during fiscal year 1993. 

The primary purpose behind restricting the data is to 
obtain an appropriately homogeneous data set for analyzing VSI 
and SSB program subscribers. Restricting the data to the 
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TABLE 5 


DELETIOKS FROM THE FOFULATION OF HARIME ENLISTED PERSONNEL 
ELIGIBLE TO SEPARATE FROM THE MARINE CORPS 
UNDER THE VS I AND SSB PROGR2VM 


Reason for Deletion 

# of Observations 


Meeting 

Criteria 

Initial Population Size; 


19,212 

Missing/Illogical variable values: 

1,677 


Female: 

961 


E-4 

287 


E-7 

3,905 


YOS < 8 

727 


YOS > 14 

6,692 


AGE < 24 

6 


AGE >36 

2,417 


Final Data Set Size ; 

—— 1 ■ 1 ■ 


8,042 


Footnote; Numbers do not sum to total due to possibility of an 
observation meeting more than one criteria for deletion (i.e., 
female who is an E-4). 

Source; Author 


above criteria ensures the development of a theoretically 
sound classical regression model. During the research, 
separate linear probability models are estimated on 
appropriate subsets divided by gender, rank, and years of 
service. A chow test is performed on each model to determine 
coefficient stability between models. The resulting "F’' 
statistic supports the conclusion that separate models are 
required to estimate the impacts of the explanatory variables 
for each of the subsets. 
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The data are divided into three subsets. The first subset 
is labeled the "test" data subset and its purpose is twofold. 
First, it is used to validate each of the trained models. 
Second, it provides data which has not been seen by any of the 
models and is used to generate test statistics that are used 
as comparison metrics. The test data subset is created with 
the assistance of the random number generator available in the 
SAS software package. The end result is a randomly selected 
test data subset which contains approximately ten percent of 
the observations from the original data. This amounts to 819 
observations of which 80 (9.8 percent) are takers of the VSI 
and SSB programs. 

The other two subsets are "training" data subsets and 
contain observations that train all of the models. The term 
train means to estimate the explanatory variables' 
coefficients in the case of the classical regression model and 
to determine the weights of each processing element in the ANN 
models. The classical regression model's training data subset 
amounts to every observation not placed in the test data 
subset; 7,223 observations, of which 654 (9.1 percent) are 
takers of the VSI and SSB programs. 

When training an ANN, it is important that the training 
data set possess a relatively equal quantity of all possible 
outcomes. In the full data set, the ratio of takers to non¬ 
takers of the VSI and SSB programs is approximately nine to 
one. To ensure the network learns relationships and not 
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probability distributions, the training data subset used by 
the ANN models contains 1,400 observations representing 654 
takers and 746 non-takers. Observations are chosen from the 
ANN training set in a random method similar to the one used 
for the test data subset. 

In his research. Noblit used the entire population of 
Marine Enlisted personnel who qualified for separation from 
the Marine Corps under the VSI and SSB programs during fiscal 
year 1992. The data for this research differs slightly for 
two reasons. First, the data set is restricted for the 
structural homogeneity purposes discussed above thus producing 
a subset and not the population. The second difference 
concerns the difference in fiscal years. Noblit used fiscal 
year 1992 data while this research uses fiscal year 1993 data, 

C. VARIABLE DESCRIPTION 

This section is divided into two parts. The first section 
lists those variables used in constructing the regression 
model. The second section lists those variables used in 
building the neural network models. Every variable used in 
the classical regression model is used in the ANN models. 
However, the ANN models contain several additional variables. 

When constructing classical regression models, researchers 
must use great care in selecting explanatory variables. 
Researchers choose the "best" variables which act as proxies 
for certain characteristics. Causal modelers must be cautious 
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that little multicollinearity exists between the explanatory 
variables. This is not the case with ANN models. ANN models 
have been postulated to possess the capability to handle 
multiple measures of the same effect and account for the 
inter-relationships between explanatory variables [Ref. 7]. 
Therefore, ANN model developers often present all available 
variables to the ANN even if there exists substantial 
multicollinearity between them. The usefulness of the 
resultant ANN models for policy purposes is discussed in 
Chapter VII. 

The explanatory variables are classified into three groups 
of proxy variables that represent: 1) the potential value of 
military earnings, 2) the potential value of civilian 
earnings, and 3) the non-pecuniary aspects of military life 
versus civilian life, 

1. Explanatory Variables Used In The Classical Regression 
Model 

The variables in this model are chosen based upon 
previous research conducted by Noblit (1993). Appendix A 
contains a comparison of the variables used by Noblit and the 
variables used in this research. 

a. Military Compensation 

This research uses four' variables as proxies for 
railitary compensation. They represent rank and marital 
status. Noblit specified his model to include these, plus 
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proxy variables to represent proficiency pay and re-enlistment 
bonuses. This research did not use variables to represent 
proficiency pay or re-enlistment bonuses because the data did 
not contain accurate information for these conditions. Table 
6 contains a summary of each proxy variable and its expected 
sign. 

TABLE 6. 

PROXY VARIABLES OP MILITARY COMPENSATION FOR 
CLASSICAL REGRESSION MODEL 


Variable Variable description Expect 

Name Sign 


SSGT = 1 if obs is an E-6; 0 otherwise 

SINGLE = 1 if obs is not married; 0 otherwise + 

SINWDEP = 1 if obs is single and has dependents + 

MARRIED = 1 if obs is married w/o children + 

MARWCHLD - 1 if obs is married and has children NA* 


* This IS treated as the base case 
Source; Author 


(1) Rank, The variable which represents rank, 
SSGT, is set equal to one if the observation is an E-6 and is 
set equal to zero otherwise. The hypothesized sign is 
negative because an individual who is an E-6 earns more money 
than an E-5 and, therefore, would be less likely to leave the 
service. 
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(2) Marital Status. Marital status is represented 
by four dummy variables, SINGLE, SINWDEP (single with 
dependents) , Ml^RIED (no children) , and MARWCHLD (married with 
children). The variable corresponding to the observation's 
status is given the value of one while the others are coded as 
zero. Married military members with children are expected to 
be less likely to undertake an occupational change because of 
their family responsibilities. Therefore, it is hypothesized 
that the variables SINGLE, SINWDEP, and M2UIRIED will have 
positive coefficients when compared to the base case of being 
married and having children (MARWCHLD = 1). 
b. Civilian Compensation 

Thirteen variables act as proxies for the potential 
earnings of an individual in the civilian job market. They 
represent race, military job, education level, intelligence 
level, and experience. Noblit's model used all of these 
factors except the last three. Table 7 contains a summary of 
each variable and its expected sign. 

(1) Race. Race is represented by three dummy 
variables, WHITE, BLACK, and OTHMIN (other minorities). The 
variable corresponding to the observation's racial group is 
given the value of one and the other variables receive the 
value of zero. Previous studies indicate Caucasians face 
better civilian employment opportunities. Therefore, the 
expected sign of the coefficients of the variables BLACK and 
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OTHMIN should be negative when the variable WHITE acts as the 


base case. 

(2) Military Occupational Specialty. An 
observation's military occupational specialty (MOS) is 
represented by four dummy variables, FIGHTER, ADMIN, NONTECH 
(non-technical MOS), and CSSTECH (technical MOS). individuals 
who have technical MOSs receive more formalized training which 


TABLE 7. 

PROXY VARIABLES OP CIVILIAN COMPENSATION FOR 
CLASSICAL REGRESSION MODEL 


Variable Variable description Expect 

Name _Sign 


BLACK 

= 

1 

if 

obs 

is Afro-American; 0 otherwise 


OTHMIN 

= 

1 

if 

obs 

is hispanic, asian or other 

- 

WHITE 

BS 

1 

if 

obs 

is a Caucasian; 0 otherwise 

NA* 

FIGHTER 

= 

1 

if 

obs 

has a combat MOS; 0 otherwise 

- 

ADMIN 

= 

1 

if 

the 

obs is an administrator 

- 

NONTECH 

= 

1 

if 

the 

obs has a non-technical MOS 

- 

CSSTECH 


1 

if 

the 

obs has a technical MOS 

NA’ 

YR3C0LL 


1 

if 

obs 

has 3 or more yrs of college 

+ 

SOMCOLL 

= 

1 

if 

obs 

has 2 or less yrs of college 

+ 

HISCHGRD 


1 

if 

obs 

is a high school graduate 

NA* 

INTEL 

Continuous 

value equal to GCT score 

+ 

AAGE 

Continuous 

value equal to obs age 

+ 

BONUSMOS 

= 1 if obs 
instructor 

is a recruiter/drill 

•• 


















is easily transferred to the civilian marketplace. These 
individuals can make a transition to the civilian sector more 
smoothly and have greater opportunities in a civilian market 
than those with other MOS types. Therefore, it is hypothesized 
that the signs of the coefficients for the variables FIGHTER, 
AOMIN, and NOKTECH will be negative when the variable CSSXECH 
acts as the base case. Appendix B contains a list of MOSs and 
their occupational categories. 

(3) Education Level. Education level is 
represented by three dummy variables, HISCHGRD (high school 
graduate), SOMCOLL (two or less years of college), and YR3C0LL 
(three or more yeans of college). Previous studies provide 
®''^id®bce that individuals with higher education levels receive 
more compensation than those with limited formal schooling. 
Military members with more formal education have better 
opportunities in the civilian marketplace than those with less 
education. Therefore, it is hypothesized that if the variable 
HISCHGRD acts as the base case, the variables SOMCOLL and 
YR3C0LL will have positive signs. 

(4) Intelligence Level. The variable INTEL is the 
value of the observation's GCT score. This score is a 
composite of several of the separate ASVAB scores. It acts as 
a proxy for an individual's mental ability. Individuals who 
have higher mental abilities face better civilian 
opportunities and, therefore, would tend to make a transition 







to civilian life at a higher rate than those with lower mental 
ability. The hypothesized sign for this variable is positive. 

(5) Age. The variable AAGE is the observation's 
age. This acts as a proxy variable for the individual's 
experience level. Older individuals have a broader experience 
base and greater opportunities to receive additional training, 
both general and specific. Previous studies indicate that 
individuals with more experience receive higher compensation. 
Thus older individuals within our data (in the range 26 to 34 
years old) have better opportunities in the civilian market 
place than the younger individuals. Therefore, the 
hypothesized sign is positive. 

(6) Bonus Skills. The variable BONUSMOS is a dummy 
variable assigned the value of one if the observation is a 
recruiter or a drill instructor. The selection process for 
these two occupations is extremely competitive. If a Marine 
is chosen to hold either of these jobs, the Marine Corps has 
provided the individual with a message implying that he fits 
well in the organization. This increases the individual's 
confidence that he possesses the skills and abilities needed 
to complete a military career successfully. Therefore, a 
negative sign for this variable is hypothesized. A drill 
instructor or recruiter is anticipated to be less likely to 
leave the Marine Corps under these programs. 
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c. Non-Pecuniary Aspects of Militaxy Life 

Ten variables are used to capture non-pecuniary 
aspects of military life. These variables include measures of 
current job satisfaction and job duty type. Noblit's research 
used each of the same measures with one exception, physical 
fitness test class. Table 8 contains a summary of each 
variable and its expected sign. 


TABLE 8. 

PROXY VARIABLES OF NON-PECUNIARY ASPECTS OF MILITARY LIFE 
FOR CLASSICAL REGRESSION MODEL 


Variable 

Name 


Variable description 

Expect 

Sign 

INMOS 

= 1 if 

obs 

works in his primary skill 

- 

SLOWTRK 

= 1 if 

obs 

is promoted slower than peers 

+ 

FASTTRK 

= 1 if 

obs 

is promoted faster than peers 

- 

AVGPROMO 

= 1 if 

obs 

is promoted with his peers 

NA* 

JUNIOR 

= 1 if 

the 

obs has 2 yrs or less TIG 

- 

MOSEAS 

Continuous value equal to time remaining 
on active duty 


PFT_1ST 

= 1 if 

the 

obs scores well on the PFT 

- 

FMFLANT 

- 1 if 

obs 

duty station is East Coast 

+ 

FMFPAC 

= 1 if 

obs 

duty station is West Coast 

+ 

NONFMF 

= 1 if 

obs 

is not in operating force 

NA* 


* These are treateil as the base case. 
Source: Author 


(1) Job Satisfaction Variables. Job satisfaction 
is represented by seven variables which capture information on 
current job, promotability, job security, physical fitness, 
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and reward factors. The first variable, INMOS, is a dummy 
variable and is given the value of one if the observation is 
currently working in his primary MOS. The hypothesis is that 
an individual who is filling a position other than what he is 
trained for, has a lower job satisfaction level and, 
therefore, wants to explore other alternatives including 
leaving the service. The expected sign is negative. An 
individual working in his MOS will be less likely to leave the 
Marine Corps. 

The next three variables are dummy variables and 
capture the rate at which the Marine is being promoted in 
relationship to his peers. The dummy variables are SLOWTRK 
(representing a slower promotion rate) , AVGPROMO (representing 
an average promotion rate), and FASTTRK (representing a faster 
promotion rate). The observation's time-in-service at his 
last promotion is compared to the average for his occupation. 
An index is created by dividing the time-in-service at last 
promotion by the average time-in-service for Marines of the 
same pay grade and occupational category. Those with a high 
index (time-in-nervice at last promotion is greater than the 
average) are on the slow track and those with a low index 
(time-in-service at last promotion is less than the average) 
are on the fast track. The variable AVGPROMO acts as the base 
case. The expected sign is positive for SLOWTRK since low 
promotion opportunities are likely to lead to higher job 
dissatisfaction which results in voluntary separation from the 









service. The expected sign for FASTTRK is negative as fast 
proMiotion rates tend to lead to job satisfaction and 
retention. 

The fifth variable representing job satisfaction, 
JUNIOR, is a dummy variable which is assigned a value of one 
if the observation has two years time--in~grade (TIG) , or less, 
and zero otherwise. This captures the "motivational” aspects 
associated with promotions. The hypothesis is that those who 
have recently been promoted are happy with the extrinsic 
awards they have received and therefore, are less likely to 
separate voluntarily from the service. Thus the expected sign 
is negative. An individual recently promoted is less likely 
to attempt separation under the VSI and SSB programs. 

The sixth variable representing job satisfaction 
captures information on job security. The variable, HOSEAS, 
is a continuous variable which represents the amount of time, 
in months, until the observation's end-of-active-service (EAS) 
date arrives. The hypothesis is that those closer to their 
EAS are considering an occupational change and are more likely 
to separate. The sign of this variable should be negative. 
The larger the variable (further from EAS), the less likely 
the observation will take the program. 

The last variable representing job satisfaction is 
a dummy variable, PFT__1ST, and represents the score on the 
physical fitness test. Twice a year, all Marines are required 
to take a physical fitness test (PFT). The score on the test 
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is categorized into three classes based on the individual's 
score and his age. Those who receive a "first class" are 
normally in top physical condition and have little difficulty 
completing any physical activity required in the execution of 
their duties. Those who fall into the "second" or "third" 
class are in moderate or low physical condition and may 
experience difficulty in completing physical activity 
associated with their duties. The variable PFT_1ST receives 
a value of onv^ if the observation has a first class score and 
a value of zero otherwise. The hypothesis is that individuals 
with a first class score have less job dissatisfaction due to 
their physical abilities and, therefore, are less likely to 
separate. Thus the expected sign is negative. Marines with 
first class PFT score ai*e less likely to separate from the 
service under the VSI and SSB programs. 

(2) Job Duty Type. Job duty type is represented by 
three dummy variables, NONFMF, FMFLANT, and FMFPAC. Operating 
Forces in the Fleet Marine /orce (FMF) experience long 
deployments and busy training cycles. East Coast units in the 
FMF j'ail under the operational, command of the Commander, Fleet 
Marins i'crces, i^tlantic (FMFLANT) . West Coast units in the 
FMF fal’ under the operational command of the Commander, Fleet 
Marine Forces, Pacific (FMFPAC). All other non-operating 
forces are classified as being non-FMF. The variable 
correspo.iding to the obse .'■•‘'ion's status is assigned a value 








of one while the other variables are assigned a value of zero. 
Individuals in the FMF experience frequent deployments and 
intense training cycles unlike those in non-FMF billets who 
experience a more stable job environment. These differences 
can lead to higher job dissatisfaction amongst Marines 
assigned to FMF units. Therefore, the expected sign of the 
variables FMFLANT and FMFPAC are positive. Thus those in the 
FMF tend to separate under the VSI and SSB programs at a 
higher rate than those in the non-FMF. 

2. Explanatory Variables For The Artificial Heural 
Networks 

The ANN models are presented with all variables 
contained in the data. Thus, the ANN models use every 
variable which the classical regression model uses including 
the omitted variables, plus nine others. Of these nine, one 
impacts upon military compensation, five are proxy variables 
for potential civilian compensation, and three measure non- 
pecuniary aspects of military life, 
a. Military Compensation 

The continuous variable, YOS, measures the number 
of years of active service an individual has completed. This 
information directly impacts upon military pay but is not used 
in the classical regression model due t( its high collinearity 
with the other explanatory variables. The hypothesis is that 
as years-of-service increase, the tendency to leave the Marine 
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Corps decreases because both pay and the present value of 
retirement benefits increase. Thus the expected sign for YOS 
is negative. The probability of separation from the service 
decreases as years-of-service increase, 
b. Civilian Compensation 

Five additional variables are measures of the 
individual■’s mental capacity. These are represented in Table 
9. The first variable, AFQTPREC, is a continuous variable 
that reflects the percentile ranking of the individual's Armed 
Forces Qualification Test (AFQT) score. Those individuals 
with a high value for this variable scored better on the AFQT 
than their counterparts. The hypothesis is that those Marines 
in a higher percentile have better civilian opportunities than 
their counterparts and stand a higher probability of 
voluntarily leaving the service. Thus the expected sign is 
positive. 

The last four variables, ASVABRAR, ASVABRMK, 
ASVABRPC, and ASVABRHK, are continuous variables which 
represent the raw scores received on the arithmetic reasoning, 
math knowledge, paragraph comprehension, and word knowledge 
portions of the Atmed Services Vocational Aptitude Battery 
(ASVAB), respectively. The hypothesis is the same as for the 
variable AFQTPREC. Thus the expected sign for these variable 
is positive. 
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c. Non-pecuniary Benefits 


Three additional variables are added to the ANN 
models that represent some aspect of job satisfaction. These 
are represented in Table 10. The first, DCTB_YRS, is a 
continuous variable which reveals the length of time a service 

TABLE 9. 

PROXY VARIABLES OF CIVILIAN COMPENSATION 
FOR NEURAL NETWORK MODELS 


Variable Variable description Expect 

Name Sign 

AFQTPREC Percentile ranking of AFQT score + 

ASVABRAR Arithmetic Reasoning score on ASVAB + 

ASVABRPC Paragraph Comprehension score on ASVAB + 

ASVABRWK Word Knowledge score on ASVAB + 

ASVABRMK Math Knowledge score on ASVAB without + 

_f amily __ 

Source: Author =s===* 


member has been located at his current duty station. The 
longer he remains at one station, the more rooted in the 
community he becomes. His spouse and children, (if he has 
any), become content with their environment and the thought of 
disrupting this environment through a change in duty station 
may lead to increased job dissatisfaction. Thus, the expected 
sign is positive. The longer an individual has been at the 
same duty station, the higher the probability that he 
separates under the VSI and SSB programs. 












The second job satisfaction variable, ACC_DEP, 
represents the amount of time, in months, an individual has 
been deployed away from his duty station. Longer deployment 
times lead to higher job dissatisfaction and a desire to leave 
the service. Thus the expected sign is positive. As ACC_DEP 
gets larger, the probability of voluntarily leaving the 
service increases. 


TABLE 10. 

PROXY VARIABLES OF NON-PECUNIARY ASPECTS OF MILITIOiY LIFE 

FOR NEURAL NETWORK MODEL 


Variable Variable description Expect 

Name Sign 

DCTB__YRS Number of years at same duty station + 

ACC_DEP Number of months spent deployed + 

DAUS_DR1 Number of years since assigned to duty + 

__ station without family _ 

Source: Author ~ 


The last job satisfaction variable, DAUS_DR1, is a 
continuous measure of the length of time since the service 
member has deployed to a duty station where his dependents are 
not authorized. The longer this interval, the higher the 
probability the individual may be assigned to such a duty 
station. This is a good indicator of upcoming family 
separation time. Increased chances of family separation 
generate higher levels of job dissatisfaction. 


Thus the 












expected sign for this variable is positive. The larger the 
value of DAUS_DRl, the higher the probability the Marine will 
separate voluntarily under the VSI and SSB programs. 

3. Conclusion 

There are twenty-two variables used in the classical 
regression model. The ANN models use thirty-seven variables. 
The difference exists for two reasons. First, the classical 
regression model requires omitted conditions when using more 
than one dummy variable to represent a certain condition. For 
example, if race is the condition, an observation can be 
classified as white, black, and other. Only two variables are 
required to classify all possibilities properly. The ANN 
models require dummy variables for every possible category. 
This accounts for six of the additional variables. 

The other nine variables contained in the ANN model 
but not in the classical regression model are additional 
measures of the same effects which are measured by other 
variables contained in both models. Adding them to the 
classical regression model causes severe multi-collinearity 
problems. The "kitchen sink" approach is used to present 
every available variable to the network. This simulates the 
approach of researchers v;ho are unfamiliar with the theory 
behind the behavior they are investigating and are unable to 
determine relationships that exist within the data. 
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D. MODEL SPECIFICATION 

This research builds three models, one classical 
regression model and two ANN models. The classical regression 
model is based upon the theory of occupational choice and is 
an extension of previous research conducted by Noblit (1993). 
The two ANN models differ mainly in their basic topology. 
This section provides a description each network and the 
methods employed to reach solutions. 

1. Classical Regression Model 

The classical regression model is a binomial logistic 
regression model. Logistic regression models are best suited 
to forecasting problems when the dependent variable is 
dichotomous. The dependent variable in this research (for 
every model) is VSISSB which equals one if the Marine 
separated from the Marine Corps under the VSI and SSB programs 
and equals zero otherwise. 

The model has the following function form: 

p. = -i- 

1 + g -Po*Pl^li + p2^2i+ • • • 

The value of Pj equals the probability that individual i will 
separate from the Marine Corps under the VSI and SSB progr.*ms, 
given his individual values for the set of explanatory 
variables (X,, X^, ..., X„) . Bq is the constant term, e, is the 
stochastic error, and Bj, through B„ are the coefficients for 
each respective explanatory variable. 











2. NEURAL NETWORK MODELS 


The two artificial neural netwox-k models are 
constructed using a commercially available software product, 
NeuralWorks Professional II/Plus . This program is chosen for 
two reasons. First, it provides a large degree of versatility 
in constructing an artificial neural network. Every aspect of 
the network may be adjusted to suit the needs of the 
developer. The other reason stems from the support provided 
by the software developers. The documentation provided with 
the software and the technical support available make this 
product an industry leader. 

Each network is similar in design. The main 
differences between the two networks center on network 
topology. One model has one hidden layer with thirty hidden 
neurons while the other has two hidden layers, the first layer 
having twenty hidden neurons and the second layer having ten 
hidden neurons. 

Each network employs the back-propagation learning 
algorithm using the normalized cumulative delta learning rule 
discussed earlier. The network continues training until one 
of two conditions is met. First, the root mean square error 
reaches 0.001 or the network has seen 3,500,000 training 
cases. Continuing on after either of these points is 
unnecessary because a minimal error has been reached or the 
network "memorizes" the training data and can not generalize 
to the test data. 
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E. COMPARISON TECHNIQUES 


In determining which of the three models provides the best 
results, two methods are used. First, each model is run on a 
validation set of data which none of the models have seen 
before. The model which correctly predicts the largest number 
of actual outcomes receives the label as "best" model. 
Additionally, the other criterion used is the simulated 
discussed in Chapter II which was developed by Wiggins and 
Engquist. 

F. CONCLUSION 

In summary, this research attempts to compare the 
forecasting capabilities of artificial neural networks to 
classical regression techniques. This is accomplished in four 
steps. First, a homogeneous data set is obtained and divided 
into subsets to test and train each of the models. Second, a 
classical regression model is developed based upon previous 
research. Third, two artificial neural networks are 
constructed, each with a different topology. Fourth, the 
predictive capabilities of each model is compare using 
previously defined metrics. 
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V. RESULTS OF THE LOGISTIC REGRESSION MODEL 
This chapter presents a descriptive analysis of the data, 
the results of the classical regression model, an 
interpretation of the results, and the forecasting abilities 
of the model when applied to the test data subset. 


A. DATA DESCRIPTION 

Table 11 contains the descriptive statistics for the 
entire data set. Included are the means and standard 
deviations for each continuous variable as well as the 
percentage distribution of the categorical variables. A 
little more than nine percent (734 out of 8,042) of the 
individuals contained in the data separated from the Marine 
Corps under VSI and SSB. Forty-one percent are minorities and 
6.7 percent have post high school education. A little more 
than 31.0 percent are in a technical MOS and approximately 
50.0 percent are assigned to units not in the operating 
forces. The average Marine in this sample is twenty-nine 
years old, has a GCT of 105 and is twenty-one months from his 


End-of-Active-Service date. 

Appendix C contains tables representing additional 
characterist, "s of the data set. These tables include cross 
tabulations of race by promotion status and occupational 
specialty. Information in the tables indicates the possible 
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TABLE 11. 

SIMPLE DESCRIPTIVE STATISTICS FOR THE 
EXPLANATORY VARIABLES IM THE 
LOGISTIC MODEL 


Condition?, represented by Variables Percent who 

met Condition 


VSI and SSB Program "Takers" 9.1% 

SSGT (E-6S) 35.9% 

Afro-American Marines 31.4% 

Other Minority Marines 9.6% 

Marines with two years or less of college 5.4% 

Marine with more than two years of college 1.3% 

Marines not married 9.1% 

Marines not married^, who have dependents 5.9% 

Marines married, no children 14.9% 

Marines in Administrative MOSu 22.2% 

Marines in Combat MOSs 14.9% 

Marines in non-technical MOSs 31.8% 

Marine recruiters/drill instructors 21.3% 

Marines working in their MOS 77,3% 

Marines assigned to FMFLANT 18.9% 

Marines assigned to FMFPAC 32.7% 

Marines who score first class on PFT 78.9% 

Marines who are promoted at a slow rate 15.1% 

Marines who are promoted at a fast rate 13.6% 

Marines with less than three years TIG 16.5% 

Number of months until EAS (mean) 21.43 

(standard deviation) (15.05) 

GCT score (mean) 104.95 

(standard deviation) (13.57) 

Marine's Age (mean) 29.67 

(standard deviation)_ (2.10) 


Source: Author 











existence of a racial relationship in selection procedures 
used for promotion and determination of occupational 
specialty. However, conclusions regarding the promotion rate 
and the determination of occupational specialty for different 
races should not be drawn at this time for several reasons. 
First, the data set in this research is not representative of 
the entire Marine Corps due to the limitations placed on it. 
Second, a multivariate analysis is necessary to account 
properly for the relationships of race and VSI and SSB program 
selection. 


B. RESULTS OF THE LOGISTIC MODEL 

Table 12 contains the results of the logistic regression 
model constructed to forecast the. acceptance rates of the VSI 
and SSB programs among Marine, male, E-5s and E-6s. The 

probability that a member separates under either of these 

programs is obtained from the equation: 

P = 1/(1 + e'^) , where 

Z = - 0.98 + (0.06)SSGT - (0.44)BLACK - (0.21)OTHMIN 

+ (0.19)SOMCOLL + (0.26)YR3COLL - (0.74)ADMIN 

- (0.25)FIGHTER - (0.43)NONTECH - (0.46)BONUSMOS 
+ (O.Ol)INTEL + (0.04)INMOS - (0.14)FMPLANT 

- (0.27) FMFPAC - (0.21)PFT_1ST - (0.009)SLOWTRK 

- (0.33)FASTTRK - (0.02)MOSEAS - (0.28)JUNIOR 
+ (0.28)SINGLE + (0.35)SINWDEP - (0.07)MARRIED 

- (0.04)AAGE 

Note, the coefficients have been rounded off to two 
significant digits for equation presentation purposes. 


Statistics to measure the overall fit of the equation to 
the data include the -2 LOG Likelihood statistic. This 
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TABLE 12. 

LOGISTIC REGRESSION MODEL RESULTS IN 
FORECASTING THE TAKE-RATES OF THE 




vs I AND SSB 

PROGRAMS 



Variable 

Name 

Sign 

Hypo¬ 

thesis 

Parameter 

Estimate 

Standard 

Error 

Wald 

Chi- 

Square 

Pr > 
Chi- 
Square 

INTERCEPT 

NA 

-0.977 

0.867 

1.27 

0.260 

SSGT 

- 

0.065 

0.119 

0.29 

0.588 

BLACK*** 

- 

-0.437 

0.112 

15.32 

0.001 

OTHMIN 

- 

-0.207 

0.156 

1.77 

0.184 

SOMCOLL 


0.194 

0.181 

1.15 

0.283 

YR3COLL 

+ 

0.265 

0.366 

0.52 

0.470 

SINGLE'* 

+ 

0.283 

0.135 

4.38 

0.036 

SINWDEP" 

+ 

0.353 

0.163 

4.66 

0.031 

MARRIED 

+ 

-0.066 

0.123 

0.29 

0.592 

ADMIN*** 

- 

-0.741 

0.128 

33.36 

0.001 

FIGHTER* 

- 

-0.249 

0.131 

3.61 

0.057 

NONTECH*** 

.. 

-0.431 

0.106 

16.60 

0.001 

BONUSMOS*** 

- 

-0.462 

0.127 

13.23 

0.003 

INTEL*** 

+ 

0.010 

0.004 

8.67 

0.003 

INMOS 

- 

0.038 

0.115 

0,11 

0.736 

FMFLANT 

+ 

-0.135 

0.113 

1.44 

0.250 

FMFPAC*** 

+ 

-0.272 

0.101 

7.31 

0.007 

PFT_1ST** 

- 

-0.212 

0.097 

4.80 

0.029 

SLOWTRK 

+ 

-0.009 

0. 136 

0.01 

0.945 

FASTTRK** 

- 

-0.334 

0.134 

6.18 

0.013 

MOSEAS*** 

- 

-0.021 

0.003 

39.68 

0.001 

JUNIOR*’ 

- 

-0.288 

0.144 

4.02 

0.045 

AAGE* 

+ 

-0.042 

0.026 

2.63 

0.100 


* Significant at the 0.1 level. 
** Significant at the 0.05 level. 
*** Significant at the 0.01 level. 
Source: Author 
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statistic has a chi-square distribution with the null 
hypothesis that the coefficient for each explanatory variable 
of the equation is zero. For this model, the -2 LOG 
Likelihood statistic equals 209.030 with twenty-two degrees of 
freedom. The probability that the null hypothesis is true for 
this equation is less than 0.0001. Therefore, one may be at 
least 99.99 percent confident that at least one of the 
parameters of this logistic equation is not equal to zero. 

In Chapter IV, hypotheses for the sign of each variable 
are provided along with a potential explanation. Table 12 
also contains the hypothesized signs. In all but seven of the 
cases, the signs are as expected. In five of these seven 
cases, the parameter estimate is not significantly different 
from zero at the 0.10 confidence level. Therefore, one can 
conclude the parameter estimate has no effect, positive or 
negative, on the probability that the service member separates 
under the VSI and SSB programs. 

The other two variables, AAGE and FMFPAC, are significant 
and have opposite signs from those hypothesized. As a Marine 
becomes older, he may be more concerned with the benefits of 
retirement. Noticing that the military's retirement plan is 
one of the more enticing, older Marines may decide to remain 
in the Corps to receive the benefits associated with a 
military career. This may explain why the sign for the 
variable AAGE is negative. Evidently, a Marine's age as a 
proxy for the value placed upon the military's retirement plan 


86 










outweighs the effect argued in Chapter IV that age is a good 
proxy for work experience. 

The variable FMFPAC represents Marines assigned to units 
under the operational command of the Commander, Fleet Marine 
Forces Pacific and may capture other information causing the 
unexpected sign. For instance, most units assigned to the 
FMFPAC Commander are located in Hawaii and California, 
locations with a high cost of living. In these areas, service 
members may receive higher indirect pecuniary benefits from 
military "perks" such as base housing, military exchange and 
commissary facilities, and morale, welfare, and recreation 
facilities. There is potential for Marines on the West Coast 
to place more value on these "perks" than Marines who are 
located on the East Coast and elsewhere. A Marine may feel 
the military allows him to maintain a better lifestyle than he 
could maintain in the same area if he worked in the civilian 
sector. This would tend to lower the probability that he 
would separate from the service under the VSI and SSB programs 
and explains the unexpected sign. 

C. AW INTERPRETATION OF THE MODEL PARAMETERS 

Since the functional form of the model yields the log of 
the odds of a Marine separating under the VSI and SSB 
programs, it is not useful to directly interpret the variable 
parameters as it is with a linear probability model. One 
method to evaluate the partial effects of the parameters is to 
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develop a "referem e" case and then change the value of a 
single explanatory variable while holding all other variables 
constant. The resulting delta in the probability is the 
effect of a change in the explanatory variable on the 
probability that the observation is a program taker. 

Here, the reference case has all the dummy variables set 
to zero, with one exception, and the continuous variables taka 
on their mean values. The exception is the variable NOKTECH 
which is set equal to one. Therefore, this reference case is 
a male Marine Sergeant (E-5) who is white, has a high school 
education, possesses a non-technical military occupational 
specialty (see Appendix B for a breakdown of the different MOS 
groupings), is not a drill instructor or recruiter, has a GCT 
score of 105, is not working in his MOS, is not assigned to a 
unit in the Fleet Marine Force, does not score a first class 
on latest physical fitness test, is promoted at the average of 
his peers, is 21 months from his End-of-Active-Service, has 
more than two years time-in-grade, is married and has 
children, and is 3 0 years old. A person meeting this 
description stands a probability of 0.043 of separating under 
the VSI and SSB programs. 

The remainder of this section outlines the effects of 
changes in each explanatory variable for the reference or base 
case Marine. These changes would have different numerical 
values for a different reference Marine, In a situation where 
the variable has no significant effect, there is no 
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significant difference from the reference case. Table 13 
summarizes the partial effects of changes in the statistically 
significant variables. 

1. Military Compensation 

a. Rank 

SS6T, the variable representing the rank of the 
individual, has an unexpected sign but it is not significantly 
different from zero at the O.lO level. Thus, at the 0.10 
level of significance, there is no difference in being an E--6 
versus an E-5 on the probability of separating from the Marine 
Corps under the VSI and SSB programs. 

b. Marital Status 

Two of the three variables representing marital 
status, SIKGLE and SINWDEP have the expected, negative 
hypothesized sign. The variable MARRIED has an unexpected 
sign but it is not significantly different from zero at the 
0.10 significance level. Both SINGLE and SINWDEP are 
significant to the 0.05 level. If a person is single, he 
stands a 1.3 percent higher probability of separating from the 
Marine Corps under the VSI and SSB programs than a person who 
is married and has children. For a Marine who is single and 
has dependents, this change increases to 1.7 percent. There 
is no significant difference for taking the VSI and SSB 
programs between a married Marine who has children and a 
married Marine who does not have any children. 
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TABLE 13 


CHANGES IN PROBABILITIES WHEN ALTERING THE VALUES 
OF THE EXPLANATORY VARIABLES 


Variable Name 

Sign 

Hypo¬ 

thesis 

Probability of 
Separation 
Under Programs 

Change from 
Reference 
Case 

Reference 

Case 

NA 

0.0430 

NA 

BLACK 

- 

0.0282 

-0.0148 

SINGLE 

+ 

0.0562 

0.0132 

SINWDEP 

+ 

0.0601 

0.0171 

ADMIN 

- 

0.0210 

-0.0220 

FIGHTER 

- 

0.0338 

-0.0091 

CSSTECH 

- 

0.0646 

0.0216 

BONUSMOS 

- 

0.0275 

-0.0155 

INTEL 

(Increment = 
ten points) 

+ 

0.1280 

0.0850 

FMFPAC 


0.0331 

-0.0099 

PFT_1ST 

- 

0.0351 

-0.0079 

FASTTRK 

- 

0.0312 

-0.0118 

MOSEAS 
(Increment = 
one month) 


0.0278 

-0.0152 

JUNIOR 

- 

0.0326 

-0.0104 

AAGE 

(Increment = 
one year) 

+ 

0.0120 

-0.0310 


Footnote: Those variables not listed are not statistically 
significantly different from zero at the 0.10 level and, 
therefore, have no significant impact on changes in the 
reference case. 

Source: Author 
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2. clvili&n compensation 

a. Race 

The two variables representing ethnic origin, BLACK 
and OTHMIN possess the expected signs but only BLACK is 
significant at the 0.10 level. Marines of Afro-American 
decent are 1.5 percent less likely to separate under the VSI 
and SSB programs than their Caucasian counterparts. 

b. Education Level 

Both variables symbolizing education level have the 
ex^ acted positive signs. However, neither variable is 
significantly different from zero at the 0.10 level. 

c. Military Occupational Specialty 

The three variables depicting occupational 
specialty, ADMIN, FIGHTER, and NONTECH have the expected 
negative signs and are significant at the 0.05 level. Marines 
who possess an administrative MOS have an 2.2 percent lower 
probability of leaving the service than those in a non¬ 
technical MOS. For Marines in a combat or technical MOS, the 
change is a 0.9 percent decrease and a 2.2 percent increase, 
respectively. 

Individuals possessing the additional MOSs of 
Recruiter or Drill Instructor face a lower probability of 
leaving the service under the VSI and SSB programs. This is 
illustrated by the negative sign on the variable BONUSMOS. 
This variable is significant to the 0.01 level. Marines with 
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these MOSS stand a 1.6 percent lower probability of leaving 
the Corps than those who do not fill these billets. 

d. Intelligence Level 

The variable INTEL has the expected sign and is 
significant at the 0.05 level. Marines with higher GCT scores 
tend to have a higher probability of leaving the Marine Corps 
than those with lower scores. For a ten point increase in 
score, the probability increases by 8.5 percent. 

e. Age 

As a Marine's age increases, he tends not to 
participate in the VSI and SSB programs. The variable, AAGE, 
is significant at the 0.1 level. For a year older than 
thirty, a Marine stands a 3.1 percent lower probability of 
leaving the Marine Corps than a person who is thirty years old 
and in the reference case. 

3. Mon-Pecuniary Aspects Of Military Life 
a. Job Satisfaction 

Six variables captured different aspects of job 
satisfaction. First, the variable INMOS indicates whether a 
Marine works in his primary MOS. The sign of this variable is 
opposite from the hypothesized sign but the coefficient is not 
significantly different from zero. An it.dividual working in 
his primary MOS does not stand a significantly different 
probability of leaving under the programs than a person not 
working in his MOS. 
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The two variables, SLOWTRK and FASTTRK capture 
promotion rates. FASTTRK has the expected sign but 8LOWTRK 
does not. SLOWTRK's opposite sign causes little concern 
because the coefficient is not significantly different from 
zero. A Marine on the "fast track" stands a 1.2 percent lower 
probability of separating than a Marine who is promoted at an 
average rate. 

The variable, MOSEAS, represents the amount of time 
until the Marine reaches his end of contract. The negative 
sign for the coefficient is the same as for the original 
hypothesis. The variable is significant at the .01 level. 
For a month further from his EAS than, a Marine is 1.5 percent 
less likely to leave under the V.SI and SSB programs. 

The variable FPT_1ST has the expected negative sign 
and is significant at the 0.05 level. A Marine who scores a 
first class on his physical fitness test stands a 0.8 percent 
lower probability of leaving the Marine Corps under the VSI 
and SSB programs than a Marine who does not score as well on 
the physical fitness test. 

The last variable representing job s; tisfaction 
captures the amount of time since a Marine's last promotion. 
JUNIOR has the expected sign and is significant at the 0.05 
level. A Marine with less than tv/o years time-in-grade tends 
to have a 1.0 percent lower likelihood of separating under 
either of the programs than a reference case Marine with more 
than two years time-in-grade. 


93 





Jb. Job Type and Location 

The two variables, PMFPAC and FMFLANT, represent 
job type and location. Each variable has a sign different 
from the hypothesis. One variable, FMFLANT, is not 
significantly different from zero but the other, FMFPAC, is 
significant at the 0.01 level. A Marine who is assigned to 
the operating forces of the Pacific stands a 1.0 percent lower 
probability of separating under the programs while a Marine in 
the Atlantic Forces does not act significantly differently 
from a Marine not in the operating forces. 

D. PREDICTIVE CAPABILITIES OP THE CLASSICAL REGRESSION MODEL 

To determine the predictive capability of the model, it is 
applied to a test data subset withheld from the data used to 
estimate the model. For each observation, the model estimates 
the ratio of the log of the odds of participating in the VSI 
and SSB programs. After converting this value to the 
probability of separating from the Marine Corps under either 
program, the usefulness of the model may be determined. 

The estimated probabilities range from 0.0081 to 0.3388. 
To determine whether each test observation is a taker or a 
non-taker, a cut-off point of 0.0913 is used. This number is 
chosen because it is the percentage of program takers in the 
data set. If the model returns a probability higher than 
0.0912, the person is labeled as a program taker. If the 
probability is lower, then the person is a non-taker. Using 
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this criterion, the model correctly predicted 60.9 percent, 58 
takers and 441 non-takers, of the 819 observations. Table 14 
provides a breakdown of the outcomes when the mouel is applied 
to the test set. As currently calibrated, the model tends to 
over-predict the likelihood of separating under the VSI and 
SSB programs. 


TABLE 14. 

PREDICTIVE CAPABILITIES OF THE LOGISTIC MODEL 
OK THE TEST DATA SET 



QUANTITY 

PERCENTAGE 

CORRECT TAKERS 

58 

7.1 

CORRECT NON-TAKERS 

441 

53.8 

INCORRECT TAKERS 

298 

36.4 

INCORRECT NON-TAKERS 

22 

2.7 

TOTAL 

819 

100.0 
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VI. NEURAL NETWORK MODEL RESULTS 


A. INTRODUCTION 

This chapter provides the results of the ANN models. It 
begins with a brief discussion of the decision process 
involved in constructing the networks. Descriptive statistics 
for the additional variables used in the ANNs follow this 
discussion. The chapter concludes with an evaluation of the 
predictive capabilities of both models. 

Prior to examining the design of each model is it useful 
to review the neural network learning process. The goal of 
the process is for the netv;ork to learn patterns in the 
training data set and then be able to generalize these 
patterns to data sets not previously seen, A network learns 
the training data by estimating outcomes and matching these 
outcomes to the actual outcomes of the training data set. In 
other words, during learning, the network attempts to 
correctly predict the outputs of the training data set. 
Differences in the actual and estimated outputs cause the 
network to continually adjust neuron connection weights until 
the error is minimized. It is important to note that it is 
possible for a network to over-learn the training data. Over¬ 
learning causes the network to lose the ability to accurately 
forecast outcomes for data never seen before. 
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B. ARTIFICIAL NEURAL NETWORK MODEL DESIGN 


This research employs two neural network models. The 
major difference between the two centers on the network 
topology. Each network has thirty-seven input neurons and one 
output neuron. One network contains one hidden layer with 30 
hidden neurons. This network is referred to as Network #1. 
The other network contains two hidden layers, the first having 
twenty hidden neurons and the second having ten hidden 
neurons. This is referred to as Network #2. This section 
describes the design of these two networks. Table 15 provides 
an overview of the important design factors of the two neural 
networks developed during this thesis. 

1. Network Architecture 

Network architecture refers to the learning algorithm 
applied to the network. Back-propagation is chosen as the 


TABLE 15. 

DESIGN FACTORS OF THE NEURAL NETWORKS 


Design Factor 

Network #1 

Network #2 

Network Architecture 

Back-Propagation 

Back-Propagation 

Hidden Layers 

One 

Two 

Hidden Neurons 

30 

30 

Pruned Connections 

389 

448 

Transfer Function 

Sigmoid 

Sigmoid 

Learning Rule 

Normalized 

Normalized 

(Delta Rule Type) 

Cumulative 

Cumulative 

Momentum 

0 

0 


Source: Author 
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architecture for both neural networks. Current literature 
suggests this architecture provides the best results when 
atteiapting to forecast the outcomes of an event. Whil3 this 
is the only architecture investigated in this thesis, further 
research could be directed towards the usefulness of other 
architectures. Klimasauskas and Rathburn (1993) make a case 
for the use of the Extended Delta-Bar-Delta architecture. 

2 . Network Topology 

Network topology encompasses two factors, the number 
of layers in the network and the number of neurons in each 
layer. As discussed earlier. Network #1 and Network #2 differ 
in network topology. Current literature suggests that most 
forecasting problems may be solved with a network design of 
one hidden layer.[Ref. 12] Lapedes [Ref. 13] furnishes a 
calculus-based argument that no more than two hidden layers 
are required to train any neural network employing a back- 
propagation learning algorithm. This thesis examines the 
impacts of changing the number of layers from one to two. 

In determining the number of hidden neurons within 
each layer, two general rules of thumb are utilized. First, 
the total number of connections are restricted to be less than 
the number of observations in the training set. If the total 
number of connections exceeds the number of observations, the 
network tends to lose its ability to generalize to data sets 
never seen before. 
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In this research, the training data set has 1,400 
observations. Network #1 has 1,171 connections. Network #2 
has 981 connections. Table 16 contains a summary of the 
number of connections in each network. 

The bias neuron is connected to each neuron in the 
hidden and output layers. Its primary function is to 
counteract random noise that appears in the network. The bias 
clement is synonymous with the intercept coefficient in the 
classical regression model. 


Table 16. 

NUMBER OF CONNECTIONS WITHIN EACH NETWORK 


Location of 

Connections 

Number of 

Connections 

From Neurons In: 

To Neurons In: 

Network #l 

Network #2 

Input Layer 

Hidden Layer #1 

1,110 

74 0 

Hidden Layer #1 

Hidden Layer #2 

0 

200 

Hidden Layer #1 

Output Layer 

30 

0 

Hidden Layer #2 

Output Layer 

0 

10 

Bias Element 

Hidden Layers 

30 

30 

Bias Element 

Output Layer 

1 

1 

Total Connections: 

1,171 

981 

Source: Author 




The other 

rule of thumb 

concerns the 

placement of 


neurons when using more than one hidden layer. The ratio of 
neurons between connecting hidden layers is kept at a ratio of 
two to one. Therefore, in Network #2, there are twenty hidden 
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neurons in the first hidden layer and ten in the second hidden 
layer. Network #1 has all the neurons in the first, and only, 
hidden layer. When there are many neurons within a hidden 
layer, it becomes harder for the network to generalize to data 
sets it has never suen before. To counter this problem, the 
technique of pruning is used. Pruning is a method of 
eliminating connections between neurons which have negligible 
connection weights. 

NeuralWorks Professional II/Plus (hereafter referred 
to as NeuralWorks) contains a pruning technique which allows 
the user to determine the minimal weight required for a 
connection to remain active. This technique eliminates 
connections which have a weight with a magnitude that is less 
than some specified percentage of the average of all of the 
absolute weights. NeuralWorks allows the network designer to 
specify the percentage value. This research uses a value of 
forty percent for two reasons. First, it is large enough to 
prune many negligible weights. Second, it is not so large 
that it causes excessive pruning. The forty percent value 
resulted in the pruning of 389 connections in Network #1 and 
448 in Network #2. 

The ba -propagation algorithm within NeuralWorks 
supports three transfer functions. They are the summation, 
sigmoid, and hyperbolic tangent transfer functions. This 
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research experiments with both the sigmoid and hyperbolic 
tangent transfer functions. The best results are achieved 
with the sigmoid transfer function. Using the sigmoid 
transfer function results in a normal distribution of the 
weights and a small number of weights possessing large 
absolute values; two criteria recommended by NeuralWorks to 
determine if the network is an optimal solution. 

The hyperbolic tangent transfer function results in a 
"split" distribution with many weights having large absolute 
values. The summation transfer function is not investigated. 

4. Learning Rules And Rates 

NeuralWorks supports the Delta Rule, Cumulative Delta 
Rule, and the Normalized Cumulative Delta Rule when using the 
back-propagation algorithm. This research chooses the 
Normalized Cumulative Delta Rule based upon the 
recommendations of the software manufacturers. The advantage 
of this rule over the others is that it accelerates the 
training time required by the network. 

The epoch size' is set as a balance between two 
recommendations from NeuralWorks. First, NeuralWorks 
recommends the epoch size should be set to a multiple of the 
training data set size. This ensures that connection weight 
changes are made upon complete passes through the data. The 

‘ Epoch size is the number of observations presented to the 
network before the connection weights are adjusted to account for 
error difference between the estimated and actual outcomes. 
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training data set contains 1,400 observations. NeuralWorks 
also recommends the epoch size should be restricted to the 
range sixty-five to one hundred. This prevents the network 
from learning for long periods of time without updating the 
connection weights. Thus, the epoch size is set at seventy. 

As discussed in Chapter III, the learning rate is a 
value determined by the neural network designer which is used 
in calculating the change in the connection weights. This 
change is determined by multiplying the error of a connection 
times the learning rate times the output from the connection. 
Error is measured as the difference between the estimated 
value (a value between zero and one) and the actual value (a 
value of zero or one), within NeuralWorks, the learning rate 
is adjustable during the learning of the network. Table 17 
contains the values of the learning rates and the intervals 
during which they became effective. 

Momentum is a factor added to the weight changes which 
allows the network to avoid becoming trapped in local error 
minimums. The momentum term for this thesis is set to zero. 
NeuralWorks possess the capability to vary the learning rates 
and momentum values between layers. For simplicity, the 
learning rates are held constant across each layer. 

5. Stopping Criteria 

Once a network is built and begins to learn, stopping 
criteria are needed to determine the point to stop learning. 
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TABLE 17 


LEARKIN6 RATE SCHEDULE FOR THE ARTIFICIAL 
NEURAL NETWORKS 


Number of passes through data set 

Learning Rate Value 

400-800 

2.0 

801-1,200 

1.0 

1,201-1,600 

0.5 

1,601-2,000 

0.25 

2,001-2,500 

0.01 


Source: Author. 


This point may be the number of passes through the data set, 
the elapsed computer time, or functions of comparisons of the 
actual versus estimated outcomes such as root mean square 
(RMS) error or the correlation value between the predicted and 
actual outcome. Researchers may choose any combination which 
is appropriate for their research. It is important that 
stopping criteria be developed prior to model construction. 
This allows researchers to establish ground rules for the 
research before it begins instead of developing new rules as 
the research progresses. 

In this research, the criterion used is the number of 
passes through the data set. The network is presented with 
the training data set 2,500 times. This number is chosen 
because the network views the training data set a sufficient 
number of times for the network to learn patterns which exists 
in the training data set. 













When the netv/ork passes through the training data set 
2,500 times, the RMS error ranges from 0.20 to 0.35 and 
averages roughly 0.275. The correlation between the predicted 
and actual outcomes is 0.8 and above. The probability 
distribution for the training data set is 0.5. One half of 
the observations are takers and the other half are non-takers. 
If a random assessment model is used to predict the outcomes 
of the training data set, the correlation between actual and 
predicted would be 0.5. Thus, the neural networks provide a 
substantial improvement over the predictive capabilities of a 
random assessment model.^ 

When building ANN models, researchers must decide what 
criteria to use in determining the learning stop point. The 
longer a network runs, the greater the possibility it may not 
be able to generali 2 e to the test data. When this occurs, the 
ANN model has essentially memorized the training data set. 

This research experiments with both a smaller and 
larger number of passes through the data set. In both cases, 
smaller and larger numbers of presentations of the data set to 
the network lead to higher RMS error and lower corre '.ation 
between the predicted and the actual outcomes. 


^ The processing time required to complete 2,500 passes 
through the data with a personal computer that possesses an INTEL 
486 chip working at 25 megahertz is approximately eighteen hours. 
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C. DESCRIPTIVE STATISTICS OF AUGMENTED VARIABLES 

Table 18 contains the descriptive statistics for the seven 
variables added to the ANN models which are not contained in 
the classical regression model. The mean and standard 
deviation of each variable is provided. Marines in the data 
set have an average of 11 years of service, have been at their 
present jobs on the average about a year and a half, and 
average about two months of deployed training time. On the 
average, it has been about three and a half years since they 
were assigned to an overseas duty station without their 
families. The average score on the ASVAB test for these 
Marines is in the forty-fifth percentile. 

D. PREDICTIVE CAPABILITIES OF THE ANN MODELS ON THE TEST DATA 

To determine the predictive capability of the models, they 

are applied to a test data subset v/ithheld from the data used 
to estimate the model. For each observation, the model 
estimates a continuous outcome value between zero and one. If 
this value is over 0.5 then the observation is labeled a taker 
of the VSI and SSB programs. If the value is less than 0.5, 
the observation is labeled a non-taker. This value is picked 
as the cutoff point because it represent the probability 
distribution of program takers and non-takei's within the 
training data set. 

Using this criterion. Network #1 correctly predicted 58.1 
percent, 49 takers and 427 non-takers, of the 819 decisions. 
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TABLE 18 


SIMPLE DESCRIPTIVE STATISTICS FOR THE 
EXPLANATORY VARIABLES ADDED TO THE 
ANN MODELS 


Conditions represented by 
Variables 

Mean 

Standard 

Deviation 

Years of Service 

11. 24 

1.80 

Elapsed Time At Present Job 
(In Years) 

1-47 

0.83 

Accumulated Deployed Time 
(In months) 

1.99 

3.15 

Years since Last Deployed 
Overseas Without Family 

3.66 

4.09 

Percentile Ranking on ASVAB 

45.18 

25.92 

Raw Score on Arithmetic 

Reasoning (ASVAB Test) 

17.61 

6.47 

Raw Score on Paragraph 
Comprehension (ASVAB Test) 

7.49 

5.80 

Raw Score on Word Knowledge 
(ASVAB Test) 

24.97 

6.90 

Raw Score on Mathematical 
Knov/ledge (ASVAB Test) 

13.09 

5.11 

Source: Author 




T Die 19 provides a breakdown of the outcomes when the 
Network #1 is applied to the test set. As currently 
calibrated, the network tends to over-prediot the likelihood 
of separating under the VSI and SSB programs. Table 20 
displays the predictive abilities of Network #2. This network 
correctly predicted 66.8 percent, 47 takers and 500 non¬ 
takers, of the 819 observations. 
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Although it appears the prediction rates are better for 
Network #2, there is no way to determine if this difference is 
statistically different from zero. One of the drawbacks to 
using neural networks is the lack of statistical measures of 
model fit and Wald Chi-square values for parameter estimates 
in determining significance levels when assessing individual 
models and making model comparisons. 


TABLE 19. 


PREDICTIVE CAPABILITIES OF NEURAL NETWORK #1 
IN PREDICTING TAKERS OF THE VSI AND SSB PROGRANS 
ON THE TEST DATA SET 



QUANTITY 

PERCENTAGE 

CORRECT TAKERS 

49 

6.0 

CORRECT NON-TAKERS 

427 

52.1 

INCORRECT TAKERS 

312 

38.1 

INCORRECT NON-TAKERS 

31 

3.8 

TOTAL PEST DATA 

819 

100.0 


Source: Author 
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TABLE 20. 


PREDICTIVE CAPABILITIES OF NEURAL NETWORK #2 
IN PREDICTING TAKERS OF THE VSI AND SSB PROGRAMS 
ON THE TEST DATA SET 



QUANTITY 

PERCENTAGE 

CORRECT TAKERS 

47 

5.7 

CORRECT NON-TAKERS 

500 

61.1 

INCORRECT TAKERS 

239 

29.2 

INCORRECT NON-TAKERS 

33 

4.0 

TOTAL TEST DATA 

819 

100.0 


Source; Author 














VII. MODEL COMPARISONS 


A. INTRODUCTION 

This chapter compares the effectiveness of the three 
models estimated for this research. Two model evaluation 
criteria are examined. First, the predictive capabilities of 
each model are assessed using two previously defined metrics. 
Second, the partial effects of each explanatory variable 
within the three models are examined. 

B. Model Forecasting Abilities 

Table 21 contains the metrics employed to compare the 
models. The metrics are generated by applying the models to 
the test data set (n = 819) . There are two comparison 
metrics. The first, M,, was developed by Sharda and Wilson 
(1993) and is reviewed in Chapter II. M, has two components, 
Mj, and M,„. The first, M„, represents the probability that the 
model correctly predicts a program taker. The second, Mj^, 
represents the probability that the model correctly predicts 
a program non-taker. The other metric, a simulated R*, was 
developed by Wiggins et al. and is also described in Chapter 
II. The goal is to maximize each of these values. 

Applying the metrics, the logistic regression model is a 
better forecaster chan the neural network model with one 
hidden layer (Network #1). The probability it correctly 
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TABLII 21 


COMPARISON OF THE FORECASTING ABILITIES OF 
THE NEURAL NETWORK MODELS AND 
THE LOGISTIC REGRESSION MODEL 
ON THE TEST DATA SET (N = 819) 


Model Types 


Logistic Neural Neural 

Regression Network #1 Network #2 


Number Correctly 
Predicted 

499 

(60.9) 

478 

(58.1) 

547 

(66.8) 

Correctly Predicted 
Program Takers 

58 

49 

47 

Correctly Predicted 
Program Non-Takers 

441 

427 

500 

Incorrectly Predicted 
Program Takers 

298 

312 

239 

Incorrectly Predicted 
Program Non-Takers 

22 

31 

33 

Mil (Prob of correct 
taker prediction) 

0.16 

0.14 

0.16 

Mi„ (Prob of correct 
non-taker prediction) 

0.95 

0.93 

0.94 

simulated R^ 

0.0981 

0.0721 

0.0979 


Source':' Author'' 

Footnote: The number in parenthesis is the percentage value. 


predicts a program taker, M,i, equals 0.16, and is higher than 
Network #l's probability of correctly predicting a program 
taker (0.14). The logistic model is also better at correctly 
predicting program non-takers; 0.95 versus 0.93. However, 
the neural network with two hidden layers (Network #2) is as 
good as the logistic model. Both models have the same 

















probability of predicting a taker and the logistic model is 
slightly better at predicting non-takers. 

The simulated is described in detail in Chapter II. If 
a model produces outputs equal to the actual outputs in every 
case, then the simulated R* is equal to one. The simulated R* 
may generate a negative number if the model produces a 
predicted take-rate lower than the take-rate obtained by 
simply using just the mean take-rate of the sample.[Ref. 
Il;p.l5] The logistic model's simulated (0.0931) is small. 
Logistic regressions normally provide smaller R^ values than 
other regression models. [Ref. 7:p. 55] Network #1 produces 
a lower simulated R^ (0.0721) than the logistic model. 
However, Network #2 results in a simulated R^ (0.0979) which 
is essentially the same as that which is provided by the 
logistic model. 

C. Partial Effects of the Explanatory Variables 

In this research, neural networks provide forecasting 
results which are almost as good as those obtained from 
logistic regression models. However, military manpower and 
personnel analysts also concern themselves with policy 
implications suggested by models. That is, they are 
interested in the partial effects of the variables in the 
model as well as the model's prediction capabilities. 

Interpreting partial effects for logistic models is fairly 
straightforward. However, with neural networks the task is 







Kiore difficult for two reasons. First, neural networks lack 
the statistical tests required to determine the significance 
levels associated with both the explanatory variables of the 
neural network and the overall fit of the network to the data. 
Therefore, researchers are unable to determine if the 
connection weights associated with an explanatory variable are 
significantly different from zero. Without these statistical 
tests, interpretation of the significance of the explanatory 
variables is difficult, at best. 

The other drawback to interpreting the partial effects of 
the explanatory variables for neural networks evolves from the 
lack of a probability level associated with the network 
output. In this research, the neural networks are trained on 
data which possess output values of zero or one. A neural 
network producer probabilities associated with a specific set 
of input variables only when the network is trained on a data 
set which contains output values that are probabilities 
associated with a specific set of input variables. 

Obtaining a data set with this characteristic is extremely 
difficult. Most manpower and personnel data sets contain 
information on the decision a person has made. Does the 
person reenlist or separate? Does he accept VSI or SSB? It 
does not contain the probability that the individual chooses 
one path or another. 
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Therefore, neural networks do not lend themselves to 
interpreting the partial effects of explanatory variables 
because they lack statistical tests to determine variable 
significance and the output of a neural netv/ork is not a 
probability level. 

These two drawbacks hinder the ability of researchers 
employing neural network modeling techniques in analyzing the 
impacts of policy changes. In an attempt to overcome these 
deficiencies, this research compares the impact on the 
classification of an observation when changing one explanatory 
variable for a reference case while holding the other 
conditions constant. 

As in Chapter V, the reference case is defined as a male 
Marine sergeant (E-5) who is white, has a high school 
education, possesses a non-technicai military occupational 
specialty, is not a drill instructor or recruiter, has a GCT 
of 105, is not working in his primary MOS, is not assigned to 
a unit in the Fleet Marine Force, does not score first class 
on his most recent physical fitness test, is promoted at the 
average of his peers, is twenty-one months from his end-of- 
active-service date, has more than two years time-in-grade, is 
married and has children, and is thirty years old. 

In the logistic model, if the model output is greater than 
0.0913 (the proportion of program takers in the test data 
set), the observation is categorized as a program taker. In 
the neural network models, if the output is greater than 0.5 
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(the midpoint of all possible outcomes), the observation is 
categorized as a program taker. 

Table 22 contains classifications of the reference case as 
a taker or non-taker of the VSI and SSB programs and the 
resulting classification when each explanatory variable is 
changed while holding the other variables constant. The 
results are disturbing for two reasons. First, there is a 
lack of consistency between the classification of the 
reference case. If neural networks provided the same results 
as classical regression models, the base case should receive 
the same classification regardless of the model which is used. 
Here, Neural Network #1 (one hidden layer) classifies the 
reference case as a program taker while Neural Network #2 (two 
hidden layers) classifies the reference case the same as the 
logistic model does, as a program non-taker. 

The other undesired result is a lack of consistency in the 
impact on the observation's classification of changing the 
explanatory variables one at a time and holding the ethers 
constant.^ When changing the twenty-two explanatory variables 
in the logistic model one at a time, there is no switch in the 
classification of the reference case observation from a 


^ If more than one change to the reference case is desired 
simultaneously, a new reference case must be developed and 
incremental changes to it are required. Interpreting partial 
effects of explanatory variables requires, changing one explanatory 
variable at a time while holding all others constant. The method 
applied in this research to examine partial effects of explanatory 
variables does not lend itself to an analysis of multiple changes 
in the explanatory variables. 
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TABLE 22 


CLASSIFICATION OF OUTCOMES FOR A REFERENCE CASE AND 
CHANGES IN THE EXPLANATORY VARIABLES FOR 
EACH OF THE THREE MODELS 


Changed Condition 


Reference Case 
Rank of E-6 
Afro-American Decent 
Other Minority 

2 Yrs and less College 

3 Yrs and More College 
Single W/Dependents 
Married No Dependents 
Single, No Dependents 
In an Admin Job 

In a Combat Job 

In a Technical Job 

Recruiter/Drill 

Instructor 

GCT Score 

Working in Primary MOS 
Assigned to FMFLANT Unit 
Assigrjed to FMFPAC Unit 
Scores l“ Class on PFT 
Months Until EAS 
Less Than Two Years TIG 
Age 

Slow Promotion Track 

Fast Promotion Track 
Source: Author 


Classification 


Logistic Net #1 


Net #2 


Non-Taker 

Non-Taker 

Non-taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 


Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Taker 

Non-Taker 

Taker 

Non-Taker 

Taker 

Taker 

Taker 

Taker 

Non-Taker 


Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Taker 

Non-Taker 

Taker 

Non-Taker 

Non-Taker 

Taker 

Non-Taker 


Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 

Non-Taker 


Non-Taker 
Taker 
Non-Taker 
Non-Taker 
Taker 
Taker 
Non—Taker 
Taker 
Taker 
Non-Taker 


Non-Taker 
Non-Taker 
Non-Taker 
Non-Taker 
Taker 
Non-Taker 
Non—Taker 
Non-Taker 
Taker 
Non-Taker 
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program taker to a non-taker, with one exception. If the 
observation's GCT score is increased by ten points, the 
observation becomes classified as a program taker. All other 
changes to the reference case result in observations which are 
classified as program non-takers. 

Neural Network #1 switches the classification of the 
reference observation from a program taker to a program non¬ 
taker in eleven instances. Neural Network #2 switches the 
observation's classification from a program non-taker to a 
program taker in five instances. Even more confusing, none of 
the switches in classifications occur when changing the same 
explanatory variable. 

In order to use artificial neural networks to obtain 
guidance for policy variables, it is important to have some 
degree of consistency of results among models. This is not 
the case with the networks developed in this research when the 
partial effects of the explanatory variables are explored. 

D. CONCLUSIONS 

To determine the usefulness of a modeling technique to 
military manpower and personnel analysts, two aspects of the 
model roust be examined; its forecasting ability and 3ts 
capacity to indicate the partial effects of the explanatory 
variables. When comparing the logistic and neural network 
models whJeh are developed in this research, two aspects of 
neural networks are obvious. First, the forecasting ability 
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of the network with two hidden layers is equal to that of the 
logistic model. Second, neural networks are not effective 
tools for analyzing the partial effects of explanatory 
variables. The networks lack consistency when compared to 
each other and compared to the results provided by the 
logistic model. since military manpower and personnel 
analysts concern themselves with the impacts of policy, 
extreme caution should be used when solely employing 
artificial neural networks in analyzing the partial effects of 
explanatory variables. 
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VIII. CONCLUSIONS AND RECOMMENDATIONS 


A. INTRODUCTION 

This thesis compares the forecasting abilities of two 
neural network models with the forecasting abilities of a 
logistic regression model. The models attempt to forecast the 
take-rates for the VSI and SSB programs for male, Marine Corps 
E-5s and E-6s in fiscal year 1993. The research builds three 
models, one classical regression model and two neural network 
models. The logistic regression model contains twenty-two 
variables and is based upon the theory of occupational choice. 
The neural networks are constructed using all potential 
explanatory variables available from the data set (37 
variables). Three sub-samples are created to build and test 
each model; a training data set consisting of 7,223 
observations for the logistic regression model, a training 
data set of 1,400 observations for the neural networks, and a 
test data set of 816 observations for all three models. 

The neural network models employ a back-propagation 
architecture with a normalized cumulative delta learning rule, 
a sigmoid transfer function, and are trained for 2,500 passes 
through the data set. The overriding difference between the 
two models is the number of hidden layers. One network has 
one hidden layer while the other has two. Both networks have 
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thirty hidden neurons in the hidden layers. The logistic 
regression model is based on previous research of the VSI and 
SSB programs conducted by Noblit (1993). 

This chapter presents several conclusions from the 
comparisons of the three networks. It also provides policy 
implications and concludes with recommendations for future 
research. 


B. CONCLUSIONS 

Three conclusions may be drawn from this research. First, 
the neural networks designed with two hidden layers possess 
similar forecasting abilities to the logj.stic regression model 
built in this research. Even though the logistic model is a 
slightly better forecaster of program non-takers, artificial 
neural networks provide forecasting results as good as those 
obtained from classical regression models. 

The second conclusion concerns which topology provides 
better results. In this research, a neural network designed 
with two hidden layers (Network #2) supplies slightly better 
forecasting results than a neural network with one hidden 
layer (Network #1). However, Network #2 generated slightly 
more false negatives'* than Netv/ork #1. This is a cause for 
some concern since false negatives carry a relatively high 
cost in the execution of the VSI and SSB programs. In this 


■* A false negative occurs when the model predicts a program 
non-taker and in reality, the observation is a program taker. 
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research, increasing the number of hidden layers increases the 
forecasting capabilities of the neural networks. 

The third conclusion concerns the usefulness of neural 
networks in evaluating policy decisions. Military manpower 
and personnel analysts concern themselves with policy 
implications suggested by models. That is, they are 
interested in the partial effects of the variables in the 
model as well as the model's prediction capabilities. 

Interpreting partial effects for neural networks is a 
difficult task because 1) neural networks lack the statistical 
tests required to determine the significance levels of 
explanatory variables and the overall model fit to the data 
and 2) neural networks lack the ability to produce a 
probability level associated v;ith a set of explanatory 
variables. In addition to these difficulties, there is a lack 
of consistent results when evaluating the partial effects of 
explanatory variables across neural network models constructed 
on identical data sets. These inconsistencies and 
difficulties do not allow researchers to interpret easily the 
partial effects of the explanatory variables contained in 
neural network models. Therefore, neural networks should be 
used in conjunction with modeling techniques that support 
analyzing the partial effects of explanatory variables. 
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C. POLICY IMPLICATIONS 


Models developed in this research have several 
implications about the VSI and SSB programs. First, if a 
Marine E-5 or E-6 has a technical MOS, he stands a higher 
probability of taking the program than a Marine in another 
MOS. This indicates that Marines with specialized training 
are leaving the Marine Corps at a greater rate than their non 
skilled counterparts. If the Marine Corps is attempting to 
reduce the number of its technicians, then this is not 
problem. However, decision makers must realize that the 
training cost associated with these individuals is high. It 
is important for the Marine Corps to receive a return on its 
investment in these individuals. Further evaluation of the 
VSI and SSB programs is required to determine if these 
programs are causing the Marine Corps to receive a rate of 
return below acceptable levels on the technical training of 
Marines in highly specialized MOSs. 

Three variables in the logistic model PFT_1ST, BONUSMOS, 
and FASTTRK, capture characteristics of the type of individual 
the Marine Corps desires to retain. This research reveals 
that male E-Bs and E-6s with these characteristics are not 
separating from the Marine Corps under the VSI and SSB 
programs. Those male E-5s and E-6s who score first class on 
their PFT (represented by PFT_1ST) stand a 2.8 percent lower 
probability of separating from the Marine Corps under the VSI 
and SSB programs than their non-first class PFT counterparts. 
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Drill Instructors and Recruiters (represented by BONDSMOS ) 
stand a 5.5 percent lower probability of separating than their 
counterparts who are not Drill Instructors and Recruiters. 
Those Marine E“5s and E-6S promoted faster than their peers 
(represented by FASTTRK) stand a 4.2 percent lower probability 
of separating than their counterparts who are promoted at an 
average rate. 

Artificial neural networks have potential to provide 
excellent forecasting results. However, before they may be 
used independently of other modeling techniques in the 
research of manpower and personnel related problems, the 
algorithms within the neural network "black box" must be 
advanced to a level which provides more information on the 
importance of individual explanatory variables. This impacts 
on military policy in two ways. First, policy makers must 
ensure sound statistical methods are employed when analyzing 
military manpower and personnel related data. Second, further 
research into neural networks is required to examine the 
appropriate domain of application for this emerging 
technology. 

D. RECOMMENDATIONS FOR FURTHER STUDY 

One area for further research centers on the different 
architectures available within the context of NeuralWorks. 
This thesis examined only the back-propagation learning 
algorithm. NeuralWorks supports fifteen other learning 
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algorithms. The application of these algorithms to manpower 
related data sets is rare. Two algorithms which have produced 
useful results in other types of applications are the Delta- 
Bar-Delta and Extended-Delta-Bar-Delta algorithms. An 
analysis of the impacts of these architectures on the 
forecasting capabilities of a neural network built with a 
manpower related data set would be valuable. 

A second area of research is related to the choice of the 
transfer function. This research experiments with both the 
hyperbolic tangent transfer function and the sigmoid transfer 
function. The transfer function is held constant across all 
layers. NeuralWorks supports the ability for changing the 
transfer function from one layer to the next. Researchers 
could experiment with adjusting the transfer functions between 
layers and evaluate any change in network performance. One 
possibility is to use a threshold function instead of a 
saturation function as applied in this thesis. A threshold 
function forces a neuron to excite and output a value of one 
if it receives inputs above some level. Otherwise, it remains 
dormant and outputs a zero. Further research into the 
implications of employing a threshold function within the 
design of neural networks may provide some useful insights 
into further understanding this developing field. 

A third area of potential research concerns the modeling 
techniques applied to the VSI and SSB take-rate problem. 
Marines who choose not to accept these programs the first time 
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they become eligible are also eligible in the following year. 
Sequential modeling techniques are the best suited to model 
this type of problem. Single period logistic modeling 
techniques provide excellent exploration results but are 
inadequate at controlling many of the features needed in 
situations where individuals are faced with a recurring 
decision. 

This research builds a model for male, Marine Enlisted 
between the ranks of E-5 and E-6. Models constructed for 
other subgroups (i.e., females, Marine Officers, etc.) may 
reveal very different decision patterns. This would assist 
policy analysts in examining the impacts of policy changes for 
other Marines eligible for the VSI and SSB programs. 

This research did not distinguish between those who 
separate under the VSI program and those who separate under 
the SSB program. Each program differs in the way that the 
cash incentive is paid out. Research on the differences 
between program selection and the characteristics of 
individuals choosing the different programs would provide 
policy makers with information to determine which program is 
best suited to meet desired goals. If one program is more 
productive than the other, future offerings could be 
constructed to reduce financial strain on available resources 
and eliminate program redundancies. 
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E. CONCLUDING COMMENTS 

This thesis examined the ability of neural networks to 
assist manpower and personnel policy analysts in the execution 
of their duties. Neural networks provide forecasting results 
which are comparable to those provided by classical regression 
models. However, neural networks should not be used 
independently in the analysis of manpower and personnel 


related problems, 


Neural networks present too many 


inconsistencies and lack the statistical tests required to 
examine the partial effects of the explanatory variables. 
However, those unfamiliar with the theory underlying a 
particular problem can present all data elements to the 
network and it will sort out variables and variable 
combinations which are important in determining the actual 
outcomes. 

Neural networks can add to the policy analyst's tool box. 
Combining neural networks with conventional and traditional 
statistical modeling methods enhances the capabilities of 
policy analysts and therefore, increases the knowledge base of 
decision makers. 







APPENDIX A 


VARIABLES USED IN BYRNE'S AND NOBLIT'S RESEARCH 


VARIABLE NAME VARIABLE DESCRIPTION 


BYRNE NOBLIT 


PROXIES FOR MILITARY COMPENSATION 


E4 

= 1 IF AN E4 
= 0 IF NOT AN E4 

NO 

YES 

E5 

= 1 IF AN E5 
= 0 IF NOT AN E5 

NO 

YES 

E6 (Noblit) 

SSGT (Byrne) 

= 1 IF AN E6 
= 0 IF NOT AN E6 

YES 

NO 

E7 

= 1 IF AN E7 
== 0 IF AN E7 

NO 

YES 

PROPAY 

= 1 IF RECEIVING PRO PAY 
= 0 OTHERWISE 

NO 

YES 

BONUS 

= 1 IF RECEIVING AN SRB 
= 0 OTHERWISE 

NO 

YES 

SINGLE 

= 1 IF SINGLE/NO DEPEND 
= 0 OTHERWISE 

YES 

YES 

MARRIED 

= 1 IF MARRIED/NO CHILD 
= 0 OTHERWISE 

YES 

YES 

S DEP (Noblit) 
SINWDEP (Byrne) 

= 1 IF SINGLE/NO DEPEND 
= 0 OTHERWISE 

YES 

YES 

PROXIES FOR CIVILIAN COMPENSATION 

BLACK 

= 1 IF AFRO-AMERICAN 
= 0 OTHERWISE 

YES 

YES 

OTH MIN(Noblit) 
OTHMIN (Byrne) 

=1 IF NOT WHITE OR BLACK 
= 0 OTHERWISE 

YES 

YES 

ADMINSU(Noblit) 
ADMIN (Byrne) 

= 1 IF IN ADMIN JOB 
= 0 OTHERWISE 

YES 

VTTO 

A- -Cji-/ 

CMBT (Noblit) 
FIGHTER (Byrne) 

= 1 IF IN COMBAT JOB 
= 0 OTHERWISE 

YES 

YES 

CSS NT (Noblit) 
NOT_TECH(Byrne) 

= 1 IF IN NON-TECH JOB 
= 0 OTHERWISE 

YES 

YES 
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VARIABLE NAME VARIABLE DESCRIPTION 


BYRNE NOBLIT 


PROXIES FOR CIVILIAN COMPENSATION 



FEMALE 

= 1 IF FEMALE 
= 0 OTHERWISE 

NO 

YES 

INTEL 

= OBSERVATION'S GCT SCORE 

YES 

NO 

AAGE 

= OBSERVATION'S AGE 

YES 

NO 

BONUSMOS 

= 1 IF RECRUIT^R/DRILL INST 
= 0 OTHERWISE 

YES 

NO 

COLLEGE 

= 1 IF FINISHED ANY COLLEGE 
= 0 OTHERWISE 

NO 

YES 

SOMCOLL 

= 1 IF COMPLETED 1 OR 2 YRS 
= 0 OTHERWISE 

YES 

NO 

YR3COLL 

= 1 IF FINISHED 3 OR MORE 
= 0 OTHERWISE 

YES 

NO 

NON-PECUNIARY 

FACTORS INVOLVED IN STAY OR LEAVE DECISION 

TIG 

TIME IN GRADE (IN YEARS) 

YES 

YES 

PRO_IND 

A CONTINUOUS VARIABLE 
REPRESENTING PROMOTION RATE 
OF OBS COMPARED TO PEERS 

NO 

YES 

SLOWTRK 

= 1 IF PROMOTED SLOWER 
= 0 OTHERWISE 

YES 

NO 

FASTTRK 

= 1 IF PROMOTED FASTER 
= 0 OTHERWISE 

YES 

NO 

INHOS 

= 1 IF IN PRIMARY MOS 
= 0 OTHERWISE 

YES 

YES 

NON_FMF 

= 1 IF NOT IN THE FMF 
= 0 OTHERWISE 

NO 

YES 

FMFLANT 

= 1 IF ASSIGN TO FMFLANT 
= 0 OTHERWISE 

YES 

YES 

FMFPAC 

= 1 IF ASSIGN TO FMFPAC 
= 0 OTHERWISE 

YES 

NO 

YRS_EAS 

NUMBER OF YEARS TO EAS 

NO 

YES 

PFT_1ST 

^ 1 IF 3 ST CLASS ON PFT 
= 0 OTHERWISE 

YES 

NO 

MOS_ EAS 

THE NUMBER OF MONTHS TO EAS 

YES 

NO 













APPENDIX B 

MILITARY OCCUPATIONAL SPECIALTY CATEGORIES 


Category 

Military Occupational Specialty 

(Variable Name) 

Numerical 

Description 

Combat 

03XX 

Infantry 

(FIGHTER) 


08XX 

Field Artillery 


18XX 

Tank and Amphibious Assault 

Administrative 

OIXX 

Personnel and Administration 

(ADMIN) 


20XX 

Supply Admin and Operations 


31XX 

Traffic Management 


34XX 

Auditing, Finance, Accounting 


40XX 

Data Systems 


41XX 

Marine Corps Exchange 


43XX 

Public Affairs 


44XX 

Legal Affairs 


46XX 

Train and Visual Equipment 


55XX/98XX 

Music 


66XX 

Aviation Supply 


84XX 

Rec>-uiting 

Tech Support 

IIXX 

Utilities 

(CSS_TECH) 


13XX 

Engineer, Construction Equip 


21XX 

Ordnance 


28XX 

Data/Communications Maint. 


59XX 

Electronics Maintenance 


60XX/61XX 

Aircraft Maintenance 


63XX/64XX 

Avionics 


Source; Navy Mai-ine Corps Form 1008 -A (April, 1992) 
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Category 
(Variable Name) 


Service Support 
Non-Technioal 
(NON TECH) 


Military Occupational Specialty 


Numerical 


Description 


02XX 


Intelligence 


04XX 

15XX 

23XX 

25XX 

26XX 

33XX 

35XX 

57XX 

58XX 

65XX 

70XX 

72XX 

73XX 


Logistics 

Printing and Reproduction 

Ammunition and Explosive 
Ordnance Disposal 

Operational Communications 

Signals Intelligence/Ground 
Electronic Warfare 

Food Service 

Motor Transport 

Nuclear, Biological, and 
Chemical 

Military Police and 
Corrections 

Aviation Ordnance 

Airfield Services 

Air Control/Air Support/Anti 
Air Warfare 

Air Traffic Controller 
Enlisted Flight Crews 

Weather Service 


i 


68XX 
















APPENDIX C 


ASSORTED TABLES CONTAINING FREQUENCIES FROM DATA SET 


PROMOTION RATE BY RACE/ETHNIC GROUP 


Promotion 

Rate 







Caucasian 

Afro- 

American 

Hispanic 

Asian 

Other 

Slow 

689 

403 

80 

20 

34 

Rate 

(14.52) 

(15.93) 

(14.57) 

(20.48) 

(28.18) 

Average 

3,341 

1,849 

401 

58 

72 

Rate 

(70.43) 

(73.15) 

(72.65) 

(60.24) 

(59.09) 

Fast 

714 

276 

70 

19 

16 

Rate 

(15.05) 

(10.92) 

(12.77) 

(19.28) 

(12.73) 

Totals 

4,744 

(100.0) 

2,528 

(100.0) 

551 

(100.0) 

97 

(100.0) 

122 

(100.0) 















I4ILITARY 


OCCUPATIONAL SPECIALTY 
BY RACE/ETHNIC GROUP 


I 


CATEGORIES 


Military 

Occupational 

Special 



Caucasian 

Afro- 

American 

Hispanic 

Asian 

Other 

Adrain 

888 

(18.72) 

709 

(28.04) 

133 

(24.15) 

27 

(27.71) 

35 

(29.09) 

Fighter 

690 

(14.54) 

397 

(15.71) 

96 

(17.37) 

14 

(14.46) 

19 

(59.09) 

Non-Tech 

1,407 

(29.67) 

911 

(36.05) 

176 

(31.94) 

27 

(27.71) 

41 

(12.73) 

Tech¬ 

nical 

1,759 

(37.07) 

511 

(20.20) 

146 

(26.54) 

29 

(30.12) 

27 

(21.82) 

Totals 

4,744 

(100.0) 

2,528 

(100.0) 

551 

(100.0) 

97 

(100.0) 

122 

(100.0) 
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